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NO  PERIOD  TWO  IMPLIES  CONVERGENCE 


WHY  USE  TANGENTS  WHEN  SECANTS  WILL  DO? 


W.  ICahan 


Abstract:  A  familiar  cuk  la  Co  aolva  f'x)  -  0  given  a  continuously  differen 


clabla  real  funcclon 


Navton'a  Iteration  could  ba  triad;  ao  could  the 


Secant  Iteration.  Except  when  the  derivative 


coata  appreciably  leaa  to 


evaluate  than  doea 


the  Secant  Iteration  cenda  In  practice  to  converge 


ultlaately  aora  efficiently  than  Navton'a  whenever  both  lterationa  converge 
to  the  desired  root.  Whan  will  they  both  converge?  We  find  roughly  chat 
whenever  Navton'a  Iteration  converges  froa  every  starting  point  In  an  Interval 


X,  so  aust  the  Secant  Iteration  converge  froa  every  pair  of  starting  points 
In  X  provided  only  that  f  actually  reverses  slg-<  In  X.  Thia  la  an 


unexpected  way  for  the  Secant  Iteration  to  dominate  Newton's 


The  foregoing  discovery  was  uncovered  by  techniques  like  those  which 


produced  the  following  one.  Suppose  aaps  the  closed  finite  Interval 


continuously  Into  itself,  and  hence  lias  at  least  one  fixed  point  x  •  }fx) 


perhaps  several.  The  Iteration 


?ix  )  need  not  necessarily 


converge,  but  it  will  converge  in  I  froa  every  starting  point 


If  and  only  If  no  two  distinct  points  In  I  are  exchanged  by  $.  Therefore 


the  Iteration  converges  only  If  It  cannot  be  trapped  In  a  cycle  of  period  2 


on  the  other  hand  It  la  known  that  the  existence  of  a  cycle  of  "Period  Three 


Iaplles  Chaos"  (T.-Y.  LI  &  J.  A.  Yorke,  Aaer.  Math.  Monthly  82  1975).  These 
last  results  and  aore  can  be  found  In  a  little-known  paper  by  A.  N.  Sarkovakll 


(1964)  "Co-existence  of  cycles  of  a  continuous  aapplng  of  the  line  Into  Itself 
(Russian)  Ukraln.  Mat.  f  16  #1  (Math.  Rev.  28  (1964)  #3121). 


0 

SO.  Introduction 

This  work  Is  presented  In  two  parts.  Part  I  deals  with  the  iteration 
x  ■  $(x  )  when  %(x)  maps  an  Interval  I  continuously  Into  Itself  without 

*1+1  H 

exchanging  two  distinct  points  of  7.  The  theorem  that  then  the  Iteration 

converges  Is  proved  here  not  because  the  theorem  Is  new  (a  proof  by  bashurov 

and  Oglbln  (1966)  has  been  translated  Into  English)  but  because  that  proof 

contains  Ideas  that  will  be  needed  in  part  II  and,  besides,  deserve  to  be 
better  known. 

Part  II  deals  with  Newton's  and  the  Secant  Iterations  for  finding  where 
a  continuously  differentiable  real  function  f(x)  vanishes.  The  results 
presented  appear  to  be  new  and  to  provide  further  incentive,  if  any  be  needed, 
for  preferring  the  Secant  iteration  over  Newton's. 

This  introduction  Includes  a  summary  of  all  subsequent  sections'  contents 
in  order  to  help  the  browsing  reader  locate  more  easily  whatever  Interests  him. 
All  proofs  are  terminated  by  the  □  sign  in  order  to  help  the  casual  reader 
skip  over  them. 

Part  I,  Si  The  No  Swap  Theorem 

The  main  result  is  stated  here  together  with  some  of  its  history,  but 
not  proved  until  54.  The  result  Is  due  principally  to  Tarkovskii.  It 
is  valid  both  on  finite  Intervals  and  on  infinite  Intervals  regarded 
as  line  segments,  but  not  valid  on  the  projectlvely  closed  real  axis 
(regarded  as  a  circle)  except  in  special  cases. 

$2  Two  Conditions  Equivalent  to  the  No  Swap  Condition 

These  are  technical  details  used  only  in  S3. 

$3  The  One-Sided  Condition 

This  condition,  first  articulated  by  Tarkovskii  (1965),  is  the 
most  potent  equivalent  to  the  No  Swap  Condition. 


§4  Proof  of  the  No  Swap  Theorem  and  some  Applications 

Among  the  applications  are  validations  of  familiar  conditions 
sufficient  for  convergence,  and  characterizations  of  the  regions  from 
which  convergence  is  assured.  Newton's  iteration  is  seen  to  be  in  one 
sense  a  special  case,  in  another  sense  more  general  than  the  iteration 
discussed  above.  The  No  Swap  theorem  is  applied  to  show  that  Newton's 
iteration  always  converges  when  used  to  find  a  zero  of  a  rational  func¬ 
tion  whose  poles  and  zeros  are  all  real,  simple,  and  Interlace. 

Part  II,  §5  Newton's  and  the  Secant  Iteration 

This  is  where  significantly  new  results  begin.  The  two  iterations  are 
described,  and  first  mention  is  made  of  a  pathological  discontinuity 
that  must  complicate  matters  (it  partially  invalidates  some  of  the 
claims  In  the  abstract  above)  even  If  we  begin  with  an  infinitely 
differentiable  function  f  x) .  The  Secant  iteration's  sparse  literature 
and  history  are  sketched  briefly,  and  then  five  examples  arc  presented 
to  give  the  reader  sosm  feeling  for  the  possibilities  with  which  our 
theory  must  cope. 

36  Projective  Invariance  of  Newton's  and  the  Secant  Iterations 
Old  but  unfamiliar  results  culminate  in  a  Mean  Value  lemma  which  binds 
the  two  iterations  together  more  firmly  than  can  any  hand-waving  about 
the  Secant  iteration  being  a  "discretization"  of  Newton's. 

S7  Inferences  from  N(I)  C  I 

The  hypothesis  that  N(x)  =  x  -  fix) lf’(z)  maps  an  interval  I  into 
Itself,  a  prelude  to  the  assumption  that  Newton's  iteration  converges, 
has  profound  consequences  ranging  from  the  monotoniclty  of  f  to  the 
Darboux  continuity  of  .V,  all  of  them  needed  in  the  next  section’s 
proofs. 


§8  The  No  Swap  Theorea  for  Newton's  Icerrtion 
Here  are  the  necessary  and  sufficient  conditions  for  Newton's  iteration 
to  generate  from  every  starting  point  a  sequence  of  iterates  of  which 
some  subsequence  converges  to  a  zero  of  f(x).  An  exanple  shows  that 
the  subsequence  complication  is  theoretically  unavoidable  though  prac¬ 
tically  lgnorable.  One  application  of  these  conditions  is  a  generaliza¬ 
tion  of  the  familiar  convexity  conditions  sufficient  for  convergence; 
we  can  allow  at  least  one  inflexion.  Consequently  certain  financial 
calculations  can  be  accomplished  via  Newton's  (or  the  Secant)  iteration 
with  no  need  first  to  obtain  safe  starting  values.  Another  application 
provides  for  rapid  computation  of  all  the  real  zeros  of  a  sufficiently 
differentiable  function  (e.g.  a  polynomial)  with  no  recourse  to  Sturm 
sequences. 

§9  The  Secant  Iteration 

Finally  Part  II 's  main  result,  which  tells  when  the  Secant  iteration 
works  as  well  as  Newton's,  is  stated  accurately  and  proved  via  a  long 
sequence  of  ten  propositions  which  exploit  almost  all  that  has  gone 
before.  A  final  example  shows  once  again  that  subsequence  complications 
are  theoretically  unavoidable  though  practically  lgnorable. 


S10  Bibliography 


PART  I 


51.  The  No  Swap  Theorem 

)(x)  Is  a  function  which  naps  a  closed  finite  Interval  I  continuously 
into  Itself.  Since  x-p(x)  cannot  have  the  same  non-zero  sign  at  both  ends 
of  I  It  must  vanish  at  least  once  In  I;  consequently  $  must  have  at 
least  one  and  possibly  several  fixed  points  x  -  (x )  In  I.  A  natural  way 
to  seek  a  fixed  point  Is  to  Iterate, 

*n+1  ■  for  n  *  0,1, 2, 3,...  ; 

but  the  iteration  cannot  always  be  expected  to  converge.  For  example, 

x  •  s1n(2*rx  )  almost  never  converges,  the  exceptions  being  a  countable 
•tvl  n 

set  of  starting  values  Xq  from  which  one  of  the  three  fixed  points  x  ■  0 
or  x  -  429368...  Is  reached  after  finitely  many  Iterations. 

Theorem:  The  condition  necessary  and  sufficient  for  the  Iteration  “  $(x^ 

to  converge  from  every  x^  In  I  turns  out  to  be 

The  No  Swap  Condition:  So  two  distinct  points  In  I  are  exchanged 
by  J;  l.e.  If  x  -  $($<x))  In  I  then  x  ■  b(x)  too. 


This  theorem  will  be  vindicated  below  in  stages  designed  to  exhibit  Interme¬ 
diate  results  which  will  be  useful  In  Part  II.  But  first  we  digress  to  dis¬ 


cuss  the  condition's  history  and  generality. 

The  No  Swap  theorem  Is  equivalent  to  the  assertion  that  x^+^  “  *i'x  ) 
converges  from  every  Xq  In  I  If  and  only  If  no  Xq  In  I  leads  to  a 
sequence  of  Iterates  cycling  on  two  points  x^n  •  x^  and  x^+1  "  xi  ^  X0" 
This  theorem  appears  (not  altogether  correctly)  In  A.S.  Sarkovskll  (1960,1961) 
and  Is  (correctly)  elaborated  upon  In  subsequent  papers  (1964,1965)  wherein 
Tarkovskii  proves,  among  other  things,  that  the  Integers  can  be  re-ordered 


thus 


3. 5. 7. 9. . . . .21+l, .... 

6.10.14.18. .  . . .4t+2, . . . , 

•  •  •  9 

2*  3, 2*  5, 2*7, 2*9 . 2*(2t+l) . 

...,  and  finally 

...,r  ,r~l . a, 4, 2,i 

(ffl) 

In  such  a  way  chat  if  from  some  i„  ■  x  '  the  iteration  x  ,  -  Kx  )  cycles 

00  n+1  n  1 

(m' ) 

on  m  distinct  points  Chen  there  are  other  starting  values  Xq  •  x^  in  I 
from  which  follow  cycles  of  every  length  subsequent  to  m  in  the  re-order¬ 

ing.  I  am  indebted  to  Prof.  Rufus  Bowen  for  references  to  Tarkovskii's  work, 
which  seems  to  go  beyond  what  has  appeared  recently  in  Che  English  language; 
cf.  Li  and  Yorke  (1975),  Stepleman  (1975),  and  Bashurov  and  Ogibin  (1966). 

In  this  paper  only  tho^e  parts  of  Tarkovskii's  work  that  bear  upon 
Part  II  will  be  repeated. 

The  No  Swap  theorem  is  stated  above  for  a  continuous  map  $  of  a  closed 
finite  (i.e.  compact)  Interval  I  to  itself.  Must  I  be  both  closed  and 
finite?  No;  it  could  be  neither.  But  a  more  general  form  of  the  No  Swap 
theorem  involves  complications  which  obscure  proofs  already  complicated  enough. 
For  Instance,  because  the  No  Swap  theorem  neglects  to  mention  that  the  itera¬ 
tion  x^+^  ■  $(x^),  when  it  converges,  converges  to  a  fixed  point  of  t, 
the  theorem  remains  valid  whether  or  not  I  Includes  its  end-points;  the 
example  $(x)  5  x  on  the  open  Interval  I  =  {0<x<l}  illustrates  the  possi¬ 
bility  of  convergence  to  an  end-point  not  in  I.  The  reader  who  wishes  to 
prove  the  No  Swap  theorem  for  non-closed  Intervals  I  may  do  so  by  first 
adjoining  to  I  any  end  at  which  11m($(xJ  -x)  ■  0  and  then  modifying  in 
routine  ways  every  reference  to  an  end  of  I  in  §§2-4;  fortunately  no  end 
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of  I  Co  which  $  cannot  be  continued  continuously  figures  significantly 
in  the  theorem. 

I  does  not  have  to  be  finite,  but  when  I  is  infinite  ambiguities  can 
arise  concerning  the  meanings  of  "continuous"  and  "convergent",  and  whether 
•  can  be  a  fixed  point  (e.g.  of  $(x)  -  x+1),  and  whether  +*  differs  from 
To  avoid  these  ambiguities  and  other  circumlocutions  in  subsequent  sec¬ 
tions  of  this  paper  we  have  assessed  without  loss  of  generality  that  I  is  a 
finite  interval.  Only  in  this  section.  Si,  do  we  digress  to  justify  chat 
assumption  by  discussing  infinite  intervals. 

The  No  Swap  theorem  remains  valid  when  $  is  a  continuous  map  of  an 
infinite  or  semi-infinite  Interval  I  to  itself  provided  the  word  "converge" 


be  understood  to  Include  possible  "convergence"  to  °°.  This  Is  so  because 
a  suitable  change  of  variables  will  transform  the  infinite  Interval  I  into 
a  finite  one.  For  instance,  suppose  $tx)  maps  I  =  continuously 

to  Itself.  The  new  variables  y  =  (x-l)/(x+l)  and  <D(yJ  =  ($(x)  -  l)/($(x)  +  1) 
3  x  •  (  1-h< ) / ( l-y )  exhibit  the  corresponding  continuous  map  ii  of  the 
transformed  interval  J  £  {-1<^<1}  to  itself.  The  example  $(xJ  =  x+1 
with  an  attractive  fixed  point  at  x  •  +*>  corresponds  to  'U(y)  •  ( l-*-y ) / ( 3 -y) 


with  an  attractive  fixed  point  at  y  ■  1;  just  as  u  ,  -  ty(y  )  converges 

”  n+l  k 

to  1,  so  must  x  -  $(x  )  "converge"  to  +“>.  The  foregoing  change  of 
variables  Is  an  instance  of  a  bilinear  rational  transformation  appropriate 


when  closure  is  not  the  whole  real  axis.  Otherwise,  when  $(D'a 


closure  is  the  whole  real  axis,  non-rational  changes  of  variables  are  more 
appropriate.  One  example  is  y  •  tanh  x  which  maps  the  affinely  closed  real 
axis  I  =  {-®<^x<^+®}  onto  the  closed  finite  intervsl  J  =  {— 1  <  l),  and 

transforms  a  function  $(x)  continuous  at  all  real  x  into 


i l>(yj  =  tanh(  arctanhfy; ;;  which  is  continuous  at  least  in  J's  interior. 


Hence  the  following  ostensibly  more  general  form  of  the  No  Swap  theorem  is  true: 
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If  I  Is  a  line  segment,  Including  or  not  Including  Its  ends, 
finite  or  Infinite,  and  If  $  maps  I  continuously  to  itself, 
then  the  iteration  x>i+^  ■  ♦  7  converges  from  every  x^  In 

I  If  and  only  If  $  exchanges  no  two  distinct  points  of  7. 

A  line  segment,  which  has  two  ends  that  may  or  may  not  be  regarded  as 
part  of  that  line  segment.  Is  quite  different  from  a  circle  which  has  no  end 
at  all.  The  No  Swap  theorem  Is  not  generally  applicable  on  a  circle,  and 
consequently  not  generally  applicable  to  real  functions  $(x)  which,  like 
rational  functions,  are  continuous  In  an  extended  sense  on  the  projectively 
closed  real  axis  I  =  {all  real  numbers  x)  U  {«} .  The  usual  change  of  varia¬ 
bles  Is  the  Stereographic  Projection  (Pig.  1)  v  =  2  arctanfx)  mod  2*  which  maps  the 
projectively  closed  real  axis  I  onto  the  circle  C  =  mod  2*)<tt), 

and  transforms  }(x)  Into  i>(y)  =  2  arctani'.Ktan  y/2))  mod  2n  which  must 
map  C  continuously  to  Itself  whenever  5  is  a  rational  function  or,  more 
generally,  whenever  either  $(x)  is  continuous  or  l/b(x)  Is  continuous  at 
every  real  x,  and  either  or  l/Gd/w)  is  continuous  at  u)  ■  0. 

But  1/  may  lack  a  fixed  point  In  C;  take  for  example  ty(y)  =  u+1  mod  2*, 
the  transform  of  the  rational  function  2<x)  l  tanfi  +  arctan  x) 

-  (x+tan  4)  /  (7  ”  x  tan  -i) ,  which  has  neither  cycle  nor  fixed  point.  Another 

4.  L 

violator  of  the  No  Swap  theorem  Is  1 i(y)  z  -2 y  mod  2ir,  the  transform  of 
2 

d(x)  z  2x/(x  -1),  which  also  swaps  no  two  distinct  points  but  does  have 

three  repulsive  fixed  points  to  which  Iteration  converges  only  from  a  count- 

2 

able  set  of  starting  points.  A  final  example  $(x)  =  x(x+2)/((l-x)(x  +2x+2)) 
never  swaps  two  distinct  points  and  has  just  one  fixed  point  to  which  the 
iteration  x..+  1  •  ♦C xn 7  converges  from  most  starting  points,  but  the  itera¬ 
tion  defies  the  No  Swap  theorem  by  cycling  through  one  set  of  eleven  points 
starting  at  x^  •  -3.2956364...  and  through  another  eleven  starting  at 
Xq  ■  -4.2078536...  (there  are  no  shorter  cycles).  See  Fig.  2. 


% 


Two  useful  special  circumstances  are  known  when  the  No  Swap  theorem  may 
be  applied  to  a  continuous  map  of  the  circle  C  to  Itself.  One  we  have 
already  discussed  Is  the  case  when  ^(C)  is  a  proper  sub-arc  of  C  in  which 
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case  C  may  be  transformed  Into  a  line  segment  on  which  is  continuous 
by  cutting  C  at  any  point  not  in  Such  a  case  Is  exemplified  by  a 

rational  function  $(x)  =  (x“”-l)/ (x^^-l)  with  m  _>  1  that  maps  the 
projectively  closed  real  axis  Into  the  interval  -1  <  $(xJ  £  1  without 
swapping  two  distinct  points,  whence  the  No  Swap  theorem  Implies  that 
x>>+^  “  $(x^/  converges  from  any  x^.  The  second  special  circumstance  arises 
when  $  has  In  C  at  least  one  fixed  point  n  -  ty(r\)  at  which  cutting  C 
yields  a  line  segment  on  which  V  remains  continuous  and  therefore  entitled 
to  the  No  Swap  theorem.  Such  a  circumstance  arises  whenever  never  winds 

past  n,  l.e.  whenever  the  equation  i>(y)  ■  n  has  no  solution  y  other 
than  perhaps  y  •  1  across  which  the  expression  'ii(y)  -n  changes  sign. 

One  example  Is  the  rational  function  $(x)  =  (x^+1) / (x-1) “  whose  unique 
fixed  point  *  Is  approached  from  Just  one  side  as  x  -  1;  consequently 
the  projectively  closed  real  axis  may  be  cut  at  *  to  produce  the  affinely 
closed  real  axis  { -® x  <_ ■♦**> }  which  $  maps  "continuously"  to  Itself  In  a 
way  which  Justifies  the  Inference  from  the  No  Swap  theorem  that  x^  •  ♦  (x^/ 

*  from  every  real  x^.  Another  rational  example  Is  t(x)  z  (x  -l)/(x  -1) 

with  i  >  1  which  can  be  shown  to  map  the  projectivelv  closed  real  axis  one- 
to-one  onto  Itself  without  swapping  two  points;  moreover  0  has  two  fixed 
points,  an  attractive  one  between  2  and  2  and  a 

repulsive  one  between  -2  and  -2^^  w  +  l/2)^  After  the  circle  is  cut  at  the 
latter  fixed  point  the  No  Swap  theorem  implies  that  x^+^  ■  ^(x^  will  con¬ 
verge  through  the  extended  reals  (possibly  including  one  x^  •  *  and  the  next 
x  ■  0)  to  the  attractive  fixed  point  from  every  starting  point  X-  except 


the  repulsive  fixed  point.  This  example  and  the  one  before  last  will  reappear 


in  55. 


Finally,  note  chat  no  generality  la  gained  by  allowing  the  cloaed  Inter¬ 
val  I  mapped  continuously  to  Itself  by  $  to  be  a  subset  of  the  real  axis 
Instead  of  all  of  it,  because  $  could  be  defined  outside  I  by  extending 
9's  graph  horizontally  from  its  ends.  I  Is  significant  only  In  so  far  as 
It  represents  that  part  of  $'s  domain  In  which  Che  No  Swap  condition  Is 
satisfied,  outside  which  Che  condition  might  be  violated.  I's  significance 
will  become  clearer  In  $4  during  the  discussion  of  catchment  basins. 

Here  ends  the  digression  concerning  Infinite  intervals.  Henceforth 
until  §5  assume  I  is  a  closed  finite  Interval  mapped  continuously  to  Itself 
by  ♦. 

52.  Two  Conditions  Equivalent  to  the  No  Swap  Condition 

The  first  such  equivalent  condition  to  be  considered  Is 

"'he  No  Separation  Condition:  No  z  in  I  can  strictly  separate  $(3,1  from 

)<M.  ;  t.e.  either  z  l3!*3  or  1  3  £  *  ^  *($(*))  In 

I  implies 

Since  the  No  Swap  condition  Is  an  obvious  Implication  of  the  No  Separation 
condition,  our  task  is  to  verify  that  the  former  condition  Implies  the  latter, 
so  assume  that  the  No  Separation  condition  Is  violated  by,  say, 

}  i(z)l  <  z  <  $(x!  In  I  and  we  shall  exhibit  a  consequent  violator  v  of 
the  No  Swap  condition.  See  Fig.  3. 

Since  tfx)  -x  takes  opposite  signs  at  x  •  %(*)  and  at  x  •  z,  $<x,' 
must  have  a  fixed  point  y  -  $(y)  strictly  between  z  and  %(z)  .  Similarly, 
as  x  runs  down  from  z  to  I's  left-hand  end-point,  %($(x))-x  runs  from 
a  negative  value  at  x  -  z  to  a  non-negative  value  (t($(x))  maps  I's  left- 
hand  end-point  Into  I),  so  S(t('x))  must  have  a  first  fixed  point 
y  •  4($(V))  <  a;  by  "first"  Is  meant  that  t(t(x))  -x  <  0  for  y  <  x  <  z. 


Can  v  -  $(v)t  No;  otherwise  $(x)  -y  would  be  positive  at  2*2  and  nega¬ 
tive  at  2  -  V,  In  which  case  we  should  have  $(u)  -  y  -  0  at  some  u 
strictly  between  y  and  3  and  then  +($(u))  -  u  •  }(y)  -  u  •  y-u  >  0  con¬ 
tradicting  our  choice  of  V  as  the  first  value  of  2  <  z  for  which 
-  2  >  0.  Therefore  y  •  f*  $i'yj  violates  the  No  Swap 

condition.  □ 

The  second  condition  equivalent  to  the  No  Swap  condition  will  be  called 

The  No  Crossover  Condition:  If  $(v)  <_  u  <  v  <_  $(u)  In  I  then 

•  u  ■  y  •  $1 'u)  too. 

Since  this  condition  obviously  Implies  the  previous  two,  our  task  now  Is  to 
verify  chat  they  Imply  this  one,  which  we  shall  do  by  Inferring  from  a  viola¬ 
tion  av)  <  a  <  V  <  i(u,  of  the  No  Cr~ssover  condition  that  there  exists  a 

violation  z  of  the  No  Separation  condition.  See  Fig.  4. 

Consider  If  $(%(v))  2.  y  Chen  V  violates  the  No  Separation 

condition.  Otherwise,  If  $($(v)'  <  U,  we  plot  %(x)  -y  as  2  runs  through 
9(v)  2.  x  u*  Since  ®: 'xJ  -  y  <  0  at  2  •  )(v)  and  $(x)  -y  >  0  at  2  •  u 

there  must  exist  some  3  in  i(v)  <  z  <  u  with  $(z)  ■  y,  and  this  3 

violates  the  No  Separation  condition  because 

$($(2))  -  $fv)  <  2  <  u  <  V  ■  t(*)  .  □ 

We  shall  use  the  No  Crossover  condition  In  lieu  of  the  No  Swap  condition 
to  prove  inferences  from  the  latter. 
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§3.  The  One-Sided  Condition 

We  return  now  to  the  Iteration  x  ,  -  pix  )  and  show  that  the  No  Swap 

n+1  *1 

condition  Is  equivalent  to  another  condition  first  articulated  by  Tarkovskii 
(1965)  and  used  also  by  Bashurov  and  Oglbln  (1966,  Lemma  2). 

The  One-Sided  Condition:  Whenever  x^  -  P^x^J  i  x^  in  I  all  subsequent 
Iterates  "  SCx^)  also  differ  f roa  x^  and  lie  on  the  same  side 

of  Xq  as  does  x^. 

Since  the  No  Separation  condition  Is  an  obvious  Inference  from  the  One-Sided 
condition,  our  task  Is  to  Infer  the  converse  by  assuming  that  x^  violates 
the  One-Sided  condition  and  deducing  that  some  subsequent  Iterates  violate 
the  No  Crossover  condition. 

Assume  for  definiteness  that  x^  -  P'XqJ  >  x^  but  that  some  subsequent 

Iterate  x  <  xn.  We  mv  take  x  to  be  the  first  such  Iterate  and  then 
-t  —  0  m 

have  x, ,x, . x  all  greater  than  x. .  Next  let  x.  be  the  first  of 

1  i  ft—  1  0  K 


these 


Iterates  to  satisfy  x.  >  x  now  x  <  x_  <  x.  ,  <  x  ,  <  x, 

7  <  -  -?-l  —  0  —  C-l  W-l  —  * 


which  exhibits  x.  ,  and  x  ,  as  violators  of  the  No  Crossover  condition. 
«-i  -?-l 


The  One-Sided  condition  has  been  interpreted  above  as  a  property  which 
an  Iterating  function  Pix)  can  possess  If  and  only  If  i  also  satisfies 
the  No  Swap  condition.  But  the  One-Sided  condition  can  also  be  regarded  as 
a  property  of  sequences  Irrespective  of  their  genesis: 

The  One-Sided  Condition  is  satisfied  by  the  sequence  (x^,x^ .x^, . . . } 
whenever  each  member  X(_  of  the  sequence  lies  on  the  same  side  of  all  sub- 


ft  >  0;  l.e. 

each  x 

n 

satisfies 

X  <  X 

n 

for  all 

m  >  0, 

X  >  X  . 
n  n-*n 

for  all 

ft  >  0, 

or  else  x  •  x 
n  *i 


for  all  ft  >  0. 
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In  particular.  If  cha  sequence  x  .  -  $<x  )  is  generated  bv  a  One-Sided 
Iterating  function  Q(x)  then  the  sequence  must  be  One-sided  too.  Such  a 


sequence  Is  the  subject  of  the  following  1< 


No-Man'S  Land  Lenina:  If  the  sequence  {x^.x^ .x^ , . . . }  satisfies  the  One- 
Sided  condition,  and  If  It  la  not  ultimately  monotonic*,  then  the 
sequence  can  be  partitioned  Into  two  disjoint  Infinite  subsequences 
one  of  which  Increases  strictly  monotonlcally  to  a  limit  x  while 
the  other  decreases  strictly  monotonlcally  to  a  limit  x,  and  x  >  x. 


§4.  Proof  of  the  No  Swap  Theorem  and  some  Applications 

Theorem:  Suppose  I  Is  a  closed  finite  Interval  mapped  continuously  to 

Itself  by  $.  Then  the  Iteration  x^+^  ■  bix^)  converges  in  I  from 
every  x^  in  I  if  and  only  if  any  of  the  following  equivalent  condi¬ 
tions  is  satisfied. 

The  No  Swap  Condition:  if  x  -  bibix))  in  I  then  x  -  bix). 

The  No  Separation  Condition:  Either  bibis))  <_  z  <_  biz)  or  biz)  <_  z  <_  bibiz 
in  I  implies  bibiz))  ■  3  ■  bis). 

The  No  Crossover  Condition:  if  bi'J)  <_  u  <  v  <_  biu)  in  I  then 

biv)  •  U  ■  V  ■  biu) . 

Tho  One-Sided  Condition:  Whenever  x^  •  b(x^)  +  x^  in  I  all  subsequent 

Iterates  x  ,  ■  bix  )  also  differ  from  x.  and  lie  on  the  same  side 
n+1  n  0 

of  Xq  as  does  x^. 

Proof:  The  equivalence  of  these  four  conditions  has  already  been 

established,  and  the  necessity  of  the  No  Swap  condition  is  evident  since 

otherwise  the  iteration  could  cycle  on  two  distinct  points  of  7.  All  that 

is  left  is  to  show  that  these  conditions  suffice  to  ensure  convergence. 

The  sequence  of  Iterates  x^  must  be  a  One-Sided  sequence.  Therefore, 

according  to  the  No-Man's  Land  lemma,  it  is  either  ultlsiately  monotonic  and 

therefore  convergent  in  7,  or  else  it  can  be  partitioned  into  two  disjoint 

infinite  subsequences,  one  ascending  to  a  limit  x  and  the  other  descending 

to  a  limit  x  _>  x.  Is  x  >  x?  No.  Those  Iterates  x^  which  belong  to 

the  descending  subsequence  that  converges  to  x  are  followed  by  Iterates 

x  -  bix  )  which  must,  because  b  is  continuous,  converge  to  bix); 

>1+1  n 

and  yet  at  least  some  of  those  Iterates  x  .  belong  to  the  ascending 
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subsequence  end  must  converge  slso  to  z.  Therefore  z  -  $/z7,  and  simi- 
larly  z  -  $(x) .  Finally  the  So  Swap  condition  Implies  z  •  z  and  the 

V  A 

theorem  is  proved;  z^  •*  z  •  z.  □ 

The  So  Swap  theorem  may  be  applied  in  several  ways  to  decide  whether 
an  iteration  Xn+1  "  *'Xn’  converges.  One  way  is  graphical;  Che  graphs  of 
y  ■  9(xl  and  z  •  $(y)  are  mirror  Images  by  reflection  in  the  mirror 
y  m  x,  and  only  if  those  graphs  intersect  nowhere  but  on  the  mirror  is  the 
So  Swap  condition  satisfied  by  $.  This  technique  was  used  to  settle  a 
conjecture  by  Stepleman  (1975,  p.  894)  that 

z  .  ■  z  (slna/z  7  -  1/8)  ($1nfrad1ans7) 

n+1  n  n 

would  converge  to  zero  from  all  zQ,  or  at  least  from  all  sufficiently 
small  xn.  The  transformation  V  ■  180/ (^z  )  converts  the  iteration  into 

0  rj  y>i 

an  equivalent  form 

X  .  •  x  /(sin/*  7-1/8)  (s1n< degrees  7) 

which  is  easier  to  deal  with  both  numerically  and  graphically.  Then  we 
find  cycles  in  abundance,  for  Instance  -  1355.5094®... 

and  .Y1  -  -1209. 865®... .  or  XQ  -  1723.154°...  and  -Y^  -  -1568.269°..., 
so  Stepleman's  conjecture  is  false.  But  his  iteration  appears  always  to 
converge  in  the  presence  of  roundoff. 

Another  way  to  apply  the  So  Swap  theorem  is  algebraic,  applicable  when 
$(z7  is  a  rational  function.  Then  1  ♦  ($/$Cx7 7  -  &(x) )/ ($/z7  -  z)  la  alao 
a  rational  function,  and  only  if  it  haa  no  zeros  in  T  which  are  not  also 
zeros  of  $'z/-z  is  the  So  Swap  condition  satisfied  by  #.  Therefore  the 
So  Swap  condition  can  be  tested  by  removing  some  common  divisors  from  cer¬ 
tain  polynomials  and  then  invoking  Sturm  sequences  to  decide  whether  the 


polynomials  change  sign  In  I.  The  condition  to(T)  C  7  can  also  be  tested 
by  using  Stuns  sequences  to  see  whether  certain  polynomials  change  sign  In 
I.  The  details  have  been  worked  out  by  R.J.  Fateman  (1977),  who  has  written 
a  computer  program  that  runs  on  M.I.T.'s  MACSYMA  system  and  realizes  the 
following  assertion: 

When  is  a  rational  function  the  question,  whether  x  ■  ) 

pt+1  y\ 

converges  In  7  from  every  x^  In  7,  can  be  decided  by  a  finite 

number  of  rational  arithmetic  operations  without  solving  any  poly¬ 
nomial  equations. 

Ocher  applications  of  the  No  Swap  theorem  Include  easy  validation  of 
conditions  sufficient  for  convergence  in  7  from  every  x^,  three  examples 
of  which  are  these: 

1)  5 luJ  -  $(v)\  <  } la— y |  for  all  distinct  u  and  u  In  7  separated 

by  the  (it  turns  out  to  be  unique)  fixed  point  of  $  in  7's 
Interior. 

11)  -1  <  ($(u)  -  $(v))/(u-v)  for  all  distinct  u  and  V  In  7 

separated  by  one  of  )  's  fixed  points, 
ill)  0  has  In  7  Just  one  fixed  point  that  divides  7  into  at  most 
two  sub- Intervals  at  least  one  of  which  is  mapped  to  itself  by  0. 

In  all  three  cases  I  Is  presumed  to  be  mapped  continuously  to  Itself 
bv  t. 

The  reader  is  asked  to  verify  that  each  of  the  foregoing  three  condi¬ 
tions  Implies  the  No  Swap  condition,  keeping  In  mind  chat  $  must  have  a 

fixed  point  between  any  two  points  of  7  that  $  swaps.  Although  the 

three  conditions  refer  to  $'s  fixed  polnt(s)  the  conditions  do  not  require 
that  any  fixed  point's  location  be  known;  for  example,  if  J>  is  differen¬ 
tiable  one  of  the  inequalities  |$'|  <1  or  >  -1  or  $ '  >  0  respec¬ 

tively  would  suffice. 
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A  troublesome  problem  encountered  frequently  In  practice  arises  when 
conditions  somewhat  like  those  above  are  known  to  be  satisfied  by  0  in 
some  Interval  J  which  is  not  known  to  be  mapped  to  itself  by  0.  This 
problem,  locating  a  suitable  sub-interval  mapped  into  Itself,  is  so  impor¬ 
tant  chat  we  digress  here  to  discuss  how  Che  No  Swap  theorem  can  shed  light 
on  it. 

First  some  terminology.  The  interval  J  is  the  domain  of  a  continuous 

real  function  0(x) .  A  sub-interval  X  of  J  is  called  an  interval 

attracted  to  £  whenever  from  every  x_.  in  X  the  iteration  x  ,  •  0(x  ) 

7  0  n+1  n 

converges  to  Che  fixed  point  £  ■  0(£)  even  though  perhaps  some  iterates 
may  lie  outside  X  (but  all  lie  in  J) ;  we  do  not  insist  that  0(X)  C  X. 

The  fixed  point  £  ■  $(£)  is  called  attractive  whenever  it  belongs  to  some 
nen-ie generate  interval  (one  that  contains  interior  points)  attracted  to  £. 
The  catchment  basin  X,£  belonging  to  an  attractive  fixed  point  £  is 
the  largest  Interval  X  containing  £  and  attracted  to  £.  (When  £  is 
an  end-point  of  its  catchment  basin  most  other  writers  would  call  £  a 
one-sided  attractive  fixed  point  and  reserve  Che  unmodified  term  "attractive 
fixed  point"  for  one  which  lies  in  its  catchment  basin's  interior.)  One 
property  of  X(£)  is  that 


£  e  0(X(£})  C  X(l)  ; 

this  is  true  because  X(£J  is  Just  that  connected  component  containing  £ 
of  the  union  of  all  intervals  attracted  to  £,  while  0(X(£))  is  one  of 
the  intervals  attracted  to  and  containing  £  in  that  union.  That  property 
does  not  characterize  X(£)  because  other  larger  and  smaller  Intervals 
possess  the  same  property;  given  any  sub-interval  K  C  X(£J,  the  smallest 
sub-interval  X  3  (  with  £  €  0(X)  C  X  turns  out  to  be 


X  ■  (convex  hull  ( ( £}  (convex  hull({£}  UK))  C  X(£J 


as  can  be  verified  easily  vlch  Che  aid  of  che  One-Sided  condition  valid 
throughout  X(£J  .  (Cf.  Theorem  2  of  Bashurov  and  Ogibin  (1966).)  Conse¬ 
quently  Che  problem  of  locating  a  suitable  sub Interval  X  mapped  into 
itself  in  <>'£.'  is  reduced  to  the  problem  of  deciding  when  a  sub- interval 
X  lies  between  the  ends  of  <(£,’;  the  crucial  question  is  just 

Where  in  J  are  the  ends  of  £'s  catchment  basin  X(£)  ? 

Lenina :  The  closure  of  X,£)  is  the  largest  closed  Interval  containing  £ 

whose  every  Interior  point  r  satisfies 

1)  x  lies  inside  J,  and 

2)  O'xJ  lies  in  J,  and 

3)  (*f*(*(xJJ)  -  £)/($<x)  -  £)  <  1  if  Ux)  *  £. 

Consequently  the  ends  of  X  £,’  lie  among  those  points  x  which  satisfy 
-1)  x  is  an  end  of  J,  or 
-2)  )(x)  is  an  end  of  J,  or 

-3)  ♦(♦(♦(x);;  ■  Kxj. 

Proof:  <t'£)  can  contain  no  fixed  point  other  than  £  of  $(x)  nor 
of  }($  x,  nor  of  >($•} (x)))  on  the  contrary  the  One-Sided  condition 

satisfied  by  t  throughout  Xi '£)  implies  that  both  of  xJJ  and 

t(p(*(x) ))  must  lie  on  strictly  the  same  side  of  $(x)  as  does  £  provided 
t(x )  +  £,  and  hence  3)  is  satisfied.  Moreover  esch  end  of  X(£)  which  is 
not  an  end  of  J  cannot  be  mapped  by  $  into  X '£_)'»  interior,  but  must 
be  mapped  onto  either  Itself  or  the  other  end  of  X(£),  and  hence  must 
satisfy  -2)  or  -3). 

This  is  a  convenient  place  to  tabulate  the  six  ways  by  which  the  ends 
of  £'s  catchment  basin  X(£)  may  be  recognized.  We  shall  denote  Xf£J's 
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ends  by  n  and  C,  not  necessarily  both  different  from  £. 

Cases  1-3:  C  -  $({.)  is  a  fixed  point  at  one  end  of  X(£)  whose  other 
end  n  is 

Case  1:  an  end  n  of  J  with  In  X(£)'s  Interior,  or 

Case  2:  another  fixed  point  n  ■  $(nJ  i  c,  or 
Case  3:  mapped  onto  C  ■  9(A). 

These  three  cases  are  the  only  ones  which  allow  £  to  lie  at  one  end  (O 
of  X(£j . 

Cases  4-5:  s  lies  strictly  inside  the  closed  Interval  X(£)  one  of  whose 
ends,  C,  Is  an  end  of  J  with  t(Z)  in  X(£)'s  Interior  and  the 
other  end  n  Is 

Case  4:  also  an  end  n  of  J  with  in  X(£)'s  Interior,  or 

Case  5:  mapped  onto  the  first  end  £  ■  Of'nJ. 

Case  6:  £  lies  strictly  Inside  the  open  Interval  X(£)  whose  ends  n  ■ 

and  £  -  9(A)  are  swapped  by  $. 

The  lemma’s  proof  will  conclude  with  demonstrations  appropriate  to  the 
six  cases  that  X(£) ' s  closure  cannot  be  a  proper  sub-interval  of  a  larger 
Interval  X  whose  every  interior  point  satisfies  1),  2)  and  3).  Each  case 
will  be  ruled  out  either  on  the  grounds  that  any  end  of  X(£J  interior  to 
X  would  then  violate  1),  2)  or  3)  or  else  because  some  open  neighborhood 
of  an  end  of  X(£J  Inside  X  would  then  be  attracted  to  £  contrary  to 
Xi'£)’s  definition  as  the  largest  interval  containing  £  attracted  to  £. 

Suppose,  then,  that  X(£)'s  closure  is  a  proper  sub-interval  of  a 
larger  Interval  X  whose  every  interior  point  x  (including  at  least  one 
of  X(£;’s  ends)  satisfies  1),  2)  and  3).  Case  4  Is  ruled  out  by  1), 
case  b  by  3),  and  case  5  by  the  observation  that  any  sufficiently  small 
open  neighborhood  of  n  Inside  X  that  satisfies  2)  must  be  mapped  by  1 
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Into  X(£J  and  therefore  ought  to  belong  to  Xf£J .  Case  1  Is  ruled  out 
either  by  1)  (so  n  is  not  inside  X)  or  3)  unless  C  ■  £;  and  case  2 

persists  only  if  one  of  the  fixed  points  at  the  end  of  X(£J  ,  say  n,  is 

an  end  of  X  too  while  the  other  end  Z  m  £  is  inside  X;  and  case  3 

persists  only  if  Z  m  £  is  inside  X  too.  In  summary,  the  persistent 

cases  are  now  these  (see  Fig.  5): 

Cases  I  '  i  2' :  £  •  $(£)  lies  at  one  end  of  and  inside  X,  but  the 

other  end  n  of  X(£J  is  an  end  of  X  too. 

Case  3':  £  •  lies  at  one  end  of  XfCJ  ,  the  other  end  n  is  mapped 

onto  the  first  £  -  <5(0^,  and  at  least  one  of  these  ends  lies  inside  X. 

To  dispose  of  these  remaining  cases  introduce  V  =  {(XJ  UXf£j,  Evi¬ 
dently  V  C  X(£J's  closure;  otherwise  an  open  neighborhood  around  one  of 
X  £,  's  ends  would  be  mapped  into  X(Z,}  and  would  consequently  be  attracted 
to  £  contradicting  X(£)'s  maxlmalitv.  In  fact,  every  open  neighborhood 
around  one  of  Xi'£y‘s  ends  inside  X  must  be  mapped  by  4  onto  an  Interval 
part  of  which  lies  outside  X(£;’s  closure,  and  since  each  end  of  Xf£J 
inside  X  is  mapped  onto  £  we  conclude  that  ^  contains  some  open 
neighborhood  around  £.  Moreover,  every  y  +  £  inside  that  neighborhood 

V  must  satisfy 

3’)  U(4(y)J -£)/<*-£)  <  1 

either  because  y  lies  in  Xf£^  where  ♦  is  One-Sided  or  because  y  •  %(z) 
for  some  r  in  X’s  interior  where  3)  is  satisfied.  If  N  be  chosen 
small  enough  to  keep  %(b(y))  no  farther  from  £  »  $(&(£))  than  n, 
inequality  3’)  above  will  force  the  iteration  *  WQ(x^))  to  converge 

to  £  from  any  in  V  either  monotonically  through  V  outside  Xf£; 


or  ultimately  monotonically  within  X(£,J.  (Note  that  0  <_  {tlx)  -£)/(x-£)  c  1 
for  all  x  inside  X(£)  in  all  three  cases  1',  2'  and  3'  because  then  t 
has  no  fixed  point  inside  X(£; .)  Since  t  is  continuous  at  £  the  itera¬ 
tion  x,.+^  ■  J  converge  to  £  too,  and  from  any  x^  in  N, 

contradicting  X(£j's  maxlmality  again.  □ 

The  reader  may  well  wonder  why  the  lemma's  condition  3)  was  not  written 
In  the  simpler  form 

3’)  Itltlx))  -  £)/(x-£)  <1  if  x  i  £  . 

One  reason  is  that  the  leans  so  modified  could  be  contradicted  by  a  counter¬ 
example  falling  into  case  3'  in  the  proof;  take  £  ■  0,  tlx)  -  x(l-x), 
n  •  1,  and  the  positive  real  axis  for  X  in  which  3')  is  satisfied.  Ano¬ 
ther  reason  is  that  conditions  like  3)  have  already  appeared  in  the  litera¬ 
ture;  one  Instance  is 

Theorem  3.2  of  Stepleman  (197$,  p.  891):  Suppose  tlx)  is  continuous 

throughout  some  interval  containing  in  its  interior  a  fixed  point 

£  ■  $(£7  but  no  other  solution  x  of  tlx)  •  £;  then  £  is  a  point 

of  (two-sided)  attraction  for  the  iteration  x  ,  •  tlx  if  and 

n+1  n 

only  if 

tlt(tlx)  ))  -  £  i  /  \  tlx)  -  £|  <  1  whenever  tlx)  i  £ 

throughout  some  open  neighborhood  around  £. 

(The  hypothesis  that  tlx)  i  £  whenever  x  +  £  in  the  interval  cannot  be 
dropped  without  abandoning  the  theorem  to  counter-examples  which  answer 
negatively  Stepleman's  "open  question"  following  his  Exar-le  3.3  on  p.  891. 
Incidentally,  that  example  is  wrong.) 
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None  the  less,  the  leone  is  defective  for  practical  purposes  In  so 
far  as  it  sacrifices  Applicability  to  Elegance.  Equation  -3)  could  better 
by  replaced  by  the  pair 

-3')  Q($(x))  ■  x  or  $( $(x) )  •  $(x)  , 


and  inequality  3)  by  the  pair 


3") 


:  ;  x  -  £ 
-  _  r 


<  1 


anJ 


i  i  x,1  •  -  •;  < 

t  X)-', 


1 


if  *(x)  i  £ 


that  attest  to  the  One-S idedness  of  $  in  The  reader  is  asked  to 

verify,  by  retracing  Its  proof,  the  truth  of  the  lenraa  when  3)  and  -3)  are 
replaced  respectively  by  3")  and  -3').  noting  that  when  3")  Is  satisfied 
throughout  a  non-degenerate  interval  containing  £  so  too  must 
($(x)  -  £)/(x-£)  <  1  at  everv  x  +  £  in  that  interval.  Replacing  -3)  by 
-3')  improves  the  leona  because  the  latter's  solution-set  is  never  larger 
and  often  smaller  than  the  former's,  and  therefore  -3')  usually  costs  less 
to  solve  than  -3).  And  inequalities  3")  cost  less  to  test  than  3)  partly 
because  x) .  is  such  a  aess.  Finally,  the  modified  letmaa  is  accom¬ 

panied  by  convergence  theorems,  analogous  to  Stepleaan's  but  simpler,  of 
which  the  following  is  an  example. 

Corollary:  r  «  is  an  attractive  fixed  point  of  the  continuous  real 

function  t  if  and  only  if  3")  is  satisfied  throughout  some  non- 
degenerate  interval  containing  £;  and  £  is  attractive  if  it  lies 
strictly  inside  an  interval  throughout  which  3")'s  first  inequality 
is  satisfied. 

Proof:  3")  is  necessary  because  t  must  be  One-Sided  throughout  Xf£J. 

That  3")  is  sufficient  too  will  follow  after  we  find  an  Interval  I  with 


: 


£  €  $(l)  c  I  throughout  which  ($('xJ  -0/(x-£)  <1  If  x  i  £.  Then  $ 

will  have  no  fixed  point  In  I  except  £  and  will  satisfy  the  No  Swap 

condition,  and  the  Iteration  x  ,  •  }(x  )  will  have  to  converge  to  £ 

n+1  n 

from  every  x^  inside  I. 

Let  X  be  the  non-degenerate  Interval  containing  £  mentioned  In  the 

corollary  and  partition  X  into  two  sub-intervals  which  have  only  £  in 

common;  say  X  -  X^UX^  w^ere  X^  1  (x  ^  £  In  X}  and  X^  5  {x  >_  £  In  X} 

and  at  least  one  of  X^  or  X^  has  Interior  points.  3")’s  first  Inequality, 

valid  inside  X^  and  X^,  prevents  any  point  of  X  except  £  from  being 

a  fixed  point  of  };  consequently  (Oi'xJ  -  £)/(x-£)  -  1  cnnnot  change  sign 

inside  X  or  X  and  must  be  negative  to  avoid  contradicting  3")'s  first 
L  X 

Inequality  In  some  neighborhood  of  £.  Thus,  If  X  D  $(XJ ,  or  If  non¬ 
degenerate  XL  D  KX^J  or  XR  2  >^X^J ,  the  choice  of  I  that  completes 
the  proof  Is  obvious.  Otherwise  there  are  two  cases  to  consider. 

The  boundary  case;  Either  x  •  {£}  or  x  -  {£}.  Sav  the  former;  then 

L  X 

r  lies  at  one  end  of  the  non-degenerate  Interval  X^  but  $fXR,>  S. 
However,  —  (SuD  X  *n  X^)  because  (Q(xi  -  £)/(x-£)  <  1  Inside 

X^,  and  also  Xj^.’ )  £  (SUO  x  In  X^)  because  of  3")'s  first 

Inequality.  On  the  other  hand,  >  (inf  x  In  >(X^)  because 

of  3")'s  second  inequality,  which  implies  ($(x)  -  £)/(x-£)  <  1  Inside 
®i'XRJ.  Choose  I  -  UXR  t0  coinP1*te  X*1*  proof. 

The  Interior  case:  £  lies  Inside  x,  and  only  the  first  of  3")’s  Inequali¬ 
ties  Is  assumed  by  hypothesis  to  hold  Inside  X^  and  X^.  We  have 

r*. 

already  dealt  with  the  possibilities  t  X)  C  X,  t'X  )  C  X  or 

la  la 

d(X  )  c  X  ;  the  only  possibility  left  that  Is  compatible  with  the 

X  —  X 

Inequality  (t?x)  -  £)/(x-£)  <  1  valid  Inside  X^  and  Xr  is  either 
X  C  }(X  )  or  X  C  tfX  / .  Say  the  former;  then  let  the  non-degenerate 

U  K  K  la 
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interval  V  be  that  component  of  $-17X  )  OX  containing  £,  and 

La  K 

let  l  z  X  UV  ■  )(V}UV  C  X,  observing  that  $(I)  C  J  because  of 

L  1 

3*')'s  first  Inequality  valid  inside  V.  □ 


Here  ends  the  discussion  of  the  So  Swap  theroem  for  arbitrary  continuous 

Iterating  functions  $.  The  rest  of  the  paper  concerns  Newton's  and  the 

Secant  iterations  for  solving  an  equation  ?tx)  ■  0.  Contrary  appearances 

notwithstanding,  Newton's  iteration  x  •  x  -  f(x  )lf'(x  )  is  not  Just  a 

*1+ 1  *1  *1  *! 

special  case  of  the  previously  studied  Iteration  x  ,  -  fi'x  ). 

n+1  n 


Prooosi  tion:  Newton's  Iteration  is  ubiquitous;  if  $i'xj  maps  the  finite 
closed  Interval  I  continuously  into  Itself,  and  if  }  has  just  one 
fixed  point  C  •  9(Z)  in  I,  then  9<x!  •  x  -  f(x) / f*  (x)  for  some 
function  f  which  is  continuous  in  I,  vanishes  only  at  C  in  I,  < 
and  is  continuously  differentiable  in  I  except  possibly  at 


In  fact  fx)  •  x  exp  ^  iu.7  (.'X  -  9(vJ)  where  the  constants  si  0  and  the 
lower  limit  of  integration  are  assigned  different  values  for  x  on  one 
side  of  ;  than  on  the  other.  The  hypotheses  concerning  9  ensure  that 
x-  i  x  has  alwavs  the  same  sign  as  x-;,  so  the  integral  is  properly 

defined  for  all  x  i  *  in  I  and  f(x)  *  0  as  x  -*  If  f  has  non¬ 
zero  one-sided  derivatives  at  x  ■  ;  the  constants  may  be  altered  if 
necessary  to  sake  f  continuously  differentiable  at  x  •  C  too.  G 


More  generally,  the  fixed  points  of  9(x)  •  x-f(x)/f(x)  that  are 
not  zeros  of  /  turn  out  to  be  places  where  f’(x)  •  *.  Rather  than 
digress  into  generalities,  let  us  consider  the  following  useful  application 
of  the  No  Swap  theorem  to  Newton's  Iteration. 
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Suppose  ffx)  Is  a  rational  function  whose  poles  and  zeros  are  all 

real,  simple,  and  Interlace,  with  one  pole  at  x  -  <*>.  Such  a  function  has 

n 

the  form  ffx)  ■  j(x-3-  /  ui . / (x-n . ))  with  e  i  0  and  all  u>.  >  0  and 

1  : 

it,  <ir_  <•••<»;  or  It  mav  have  the  form 
12  n 

ffx)  -  det(xi-4)/det(x!-4) 

where  .1  Is  an  hermit lan  matrix,  A  Is  obtained  by  striking  off  4's  last 
row  and  column,  and  the  f's  are  identity  matrices.  Such  functions  f  play 
important  roles  during,  for  example,  the  calculation  of  eigenvalues  of 
hermitian  matrices  4;  cf.  Y.  Saad  (1974).  Now  we  shall  see  why  Newton's 
iteration  almost  always  converges  to  a  zero  of  f ;  we  shall  find  that  con¬ 
vergence  to  a  zero  can  be  precluded  only  If  x^  is  one  of  a  countable 
sparse  set  of  starting  points  from  which  the  Iteration  x  ,  ■  x  -  f'x  )/f'(x) 
terminates  at  a  finite  pole  *r .  of  f  after  finitely  many  steps. 

n+1  n 

Proof:  Write  ffx)  =  a  3  (x-C ..) /TUx-t ;)  where 

1  J  1 

C  <  <  Cj  <  n2  <  <  ^  displays  the  Interlacing  poles  * .. 

and  zeros  l _■  of  f.  Useful  equivalent  forms  for  f  are 

•t  *:+l 

ffx)  -  c(x-8-[  u ,./<*-*.))  -  d  l  y ./(x-c.) 

1  1  1  1  1  1 


1  v5t 


where  3  -  l  n.  -  £  <a.  -  IT  ,)  (C  • 


TT  v 

...  IT. -IT. 
./♦I  1  v 


J-l  ’’r1 

y.  •  II  7* — ■—  IT  ■  —  ■ .  y  >  0  and  £  y.  •  1.  Corresponding  fonts  for 
“  i-l  1  1  J 

the  Iterating  function  $fx7  =  x  -  ffx) /f*(x)  are 


i  x)  -  ; 3  ♦ 


n  u».(2x-,r .)■»  r  *:  w.  i  rn-fl  y,t.  •>  rn-fl  y.  ■> 

I  -i — r '  *♦!  — H  -  fl  -^  /( I  — ‘Li] 

1  (x-^.)z  l  1  (X“TT .)*'  1  1  (x-c.r*  V  1  (x-c.)  > 

4»  *•  a/  J 
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from  which  follow  loawdlately  the  conclusions  chsc  Che  rational  function  $(x) 
is  continuous  for  all  real  has  fixed  points  -  JW'tJ  and  u.  •  J, 

V  J  Jo 

and  maps  the  whole  real  axis  onto  the  interval  C,  <  $(x)  <  c  Conse- 

quently  Newton's  iteration  x^^  •  ♦  Cx^i  will  converge  from  every  real  x^ 

if  and  only  if  $  swaps  no  two  distinct  values  x  and  y.  But  if  $  did 

swap  thea  we  couli  rearrange  Che  equations  x  ■  }(y)  and  y  •  $(x) ,  subtract, 

n+1 

divide  out  (x-y) ,  and  infer  that  £  u.((x-r.)  i  +•  (x-C.f)”  (J/-C  .)_i  ♦  (#-£.)”  ) 

j  w  •>  *  «  J 

-  0  when  in  fact  it  must  be  positive.  Therefore  the  iteration  aust  converge 
to  one  of  }'s  fixed  points.  Since  each  zero  C.  is  a  strongly  attractive 

J 

fixed  point  (I  V;  J  -  0)  but  each  pole  n.  is  strongly  repulsive  ($'(*  J  •  2), 

o  o  j 

convergence  to  a  pole  can  occur  only  "by  accident"  after  finitely  many 
iterations,  and  even  then  rounding  errors  are  likely  to  Intervene  in  our 
favour  and  deflect  the  Iteration  to  converge  to  an  attractive  fixed  point, 
a  zero  of  f.  □ 
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PART  II 

55.  Newton's  and  the  Secant  Iterations 

I  is  again  a  closed  finite  Interval  in  which  we  now  seek  a  zero  C 
of  a  real  function  f(x )  that  is  continuous  in  I  and  continuously  once 


differentiable 

too  except  possibly  at 

its 

zero  C. 

The 

search 

for  C  begins 

at  one  or  two 

starting  approximations 

xo 

and  x^ 

in 

I  and 

attempts  to 

improve  them  via  one  of  the  following 

lterat ions ; 

Newton' s 

x  ,  -  !t(x  ) 
n+1  n 

for 

n  -  0,1,2 

,3,. 

. .  or 

Secant 

xn+i  “  S<sn*xn-i> 

for 

**  •  1.2,3 

•  • 

where,  as  illustrated  in  Figs.  6  and  7, 

X(x)  =  x  -  f(x)  / f*  (x)  if  f'x)  +  0  , 

5  X  if  f'x)  ■  0  no  matter  what  happens  to  f(x)  , 

S(x.y)  Z  x-f(x)(x-y)/(f(x)-f(y))  =  S(y,x)  if  y  *  x  and  f(x)  +  0  , 

=  ’i(x)  if  y  ■  x  or  ffx)  •  0  . 

Whether  either  iteration  converges,  and  which  iteration  is  the  better,  are 
loportant  questions  without  simple  answers;  but  the  following  theorem  sheds 
some  light  upon  them. 

Theorem:  If  X^x)  is  continuous  in  I,  and  if  Newton's  iteration  converges 

in  I  from  every  starting  point  x^  in  I,  and  provided  f  not 

merely  vanishes  but  actually  reverses  sign  across  its  zero  C  in  I, 

then  the  Secant  iteration  also  converges  in  1  from  every  pair  of 
starting  points  x^  and  x^  in  7. 


Before  embarking  upon  the  theorem's  proof  we  shall  digress  first  to  discuss 
the  almost  superfluous  continuity  requirement  upon  .V,  second  to  expose 
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some  of  Che  Secant  Iteration's  history,  and  third  to  explore  some  examples. 

When  -V  is  continuous  in  I  then,  for  reasons  exposed  in  §7  and  §9, 
the  theorem's  hypotheses  imply  that  5  is  continuous  coo  in  I  *  I .  This 
is  the  normal  state  of  affairs  and  arises,  for  example,  when  f  is  twice 
differentiable  in  I  and  f"  reverses  sign  therein  only  finitely  often 
(see  §8's  corollary's  proof).  However,  even  if  f  is  infinitely  differen¬ 
tiable  in  I,  .V  can  be  discontinuous  in  I;  an  example  is  given  in  §7. 

.V  can  be  discontinuous  only  at  C,  and  then  only  if  /'  takes  values 
arbitrarily  close  to  zero  in  the  neighborhood  of  C .  Despite  this  discon¬ 
tinuity  of  .7,  if  it  occurs,  the  theorem  above  perseveres  nearly  unchanged 
as  follows. 

I - 1 

Theorem:  If  Newton's  iteration  generates  from  every  x^  in  I  a  sequence 

of  iterates  {x  }  of  which  some  subsequence  converges  to  and 

provided  f  not  merely  vanishes  but  actually  reverses  sign  across 
its  zero  C  in  I,  Chen  the  Secant  Iteration  also  generates  from 

I 

every  x^  and  x^  in  I  a  sequence  of  Iterates  of  which  some  sub¬ 
sequence  converges  to 

This  statement  of  the  Theorem  includes  the  previous  version  above  for 
reasons  exposed  in  §8  where  conditions  necessary  and  sufficient  for  conver¬ 
gence  of  all  Newton  iterates  or  a  subsequence  of  them  are  exhibited. 

The  possible  discontinuity  of  .V  complicates  proofs  but  can  have  no 
practical  consequences  if,  as  is  customary  in  well-designed  computer  programs, 
two  criteria  are  used  to  decide  when  to  terminate  an  iteration  designed  to 
calculate  a  zero  £  of  f.  Either  stop  when  f(x~)  is  negligible,  and 
then  accept  x  as  the  approximation  to  S;  or  stop  when  several  successive 

iterates  . . . ,x  ,,x  , ,x  differ  from  each  other  negligibly  and  the  values 

*1*1  *1 

. . . ,/Yx  ,7,/Yx  ,7,/Vx  )  are  not  all  of  the  same  sign,  and  then  accept  x  . 

*!—  L  *1—  1  *1  “ 


34 


These  criteria  are  equally  applicable  when  only  a  subsequence  of  the 
iterates  converges  to  C.  Hence,  for  practical  purposes  the  theorem  says 
roughly  that  whenever  Newton's  Iteration  must  succeed  in  finding  a  zero  C 
at  which  f  reverses  sign,  so  must  the  Secant  iteration  succeed. 

Compared  with  Newton's  Iteration,  the  Secant  Iteration  has  a  sparse 

literature.  For  a  long  time  the  Secant  Iteration  x  ,  ■  S(x  ,x  ,)  was 

*1+1  K  ft— 1 

not  distinguished  from  the  REGULA  FALSI  x  ,  -  S(x  ,xn) ,  a  far  slower 
procedure.  Consequently  numerical  analysis  texts  used  to  give  it  short 
shrift,  favouring  Newton's  Iteration  Instead;  for  Instance  take  the  volte- 
face  between  the  first  and  second  editions  of  "Modern  Computing  Methods" 

(1957  and  1961).  The  Secant  Iteration  was  first  used  on  the  earliest  elec¬ 
tronic  computers  because  their  users  calculated  that  a  simpler  (no  need  to 
compute  a  derivative)  but  possibly  slow  method  executed  on  an  electronic 
computer  will  usually  yield  a  correct  answer  sooner  than  a  complicated  but 
faster  (fewer  iterations)  method  executed  by  hand.  The  first  person  to 
realize  that  both  Newton's  and  the  Secant  Iterations  run  at  comparable  speeds 
when  both  are  executed  on  the  same  computer  appears  to  have  been  David  Wheeler 
who  (according  to  Wilkes,  1966)  modified  the  Secant  iteration  cleverly  to 
serve  as  a  fast  general-purpose  zero-finder  on  one  of  the  first  electronic 
computers,  EDSAC  I  at  Cambridge;  see  program  F2  in  Wilkes,  Wheeler  and  Gill 
(1951).  We  shall  not  digress  into  Wheeler's  method  beyond  listing  its  order 
of  convergence  3*^  ■  1.442...  published  by  Wilkinson  (1967)  and  by  Dowell 
and  Jarratt  (1971  —  what  they  call  "the  Illinois  Algorithm"  is  the  ILLLAC  I 
program  transcribed  from  Wheeler's  after  he  visited  the  University  of  Illinois 
in  the  early  1950s).  See  also  Dahlquist,  BJorck  and  Anderson  (1974,  pp.  231-3). 
Wheeler's  program  is  still  widely  used,  for  example  as  program  STD14B  distri¬ 
buted  with  the  Hewlett-Packard  shirt-pocket  calculator  HP-65  (1974).  Better 
programs,  faster,  more  reliable  and,  alas,  more  complicated,  have  been 
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devised  recently  by  Brent  (1973)  and  by  Bus  and  Dekker  (1974). 

Most  of  the  Secant  iteration's  literature  dwells  upon  its  local  conver¬ 
gence  properties.  For  instance,  any  iteration  has  an  order  of  convergence 
defined  as 

11m  Inf  (-ln|x  -cl )1/l  >  1  ; 

the  greater  its  order  the  faster  its  convergence.  In  the  usual  case  when  f 
is  a  smooth  non-linear  function  with  a  simple  zero  C.  i.e.  f(Q)  •  0  ^  f’l £) 
and  f”  C-  0.  Regula  Falsi  has  order  1,  Newton's  iteration  2,  and 
the  Secant  iteration  (l  +  »5)/2  -  1.618...  .  This  last  number,  first 
derived  by  Bachmann  (1954),  does  not  imply  that  the  Secant  iteration  is 
slower  than  Newton's;  on  the  contrary,  as  pointed  out  by  Ostrowski  (1960  et  seq . ) , 
by  Traub  (1964),  and  (briefly)  by  Dahlquist,  Bjorck  and  Anderson  (1974), 
the  Secant  iteration  is  usually  the  faster  unless  the  time  consumed  com¬ 
puting  f' lx)  adds  less  than  half  to  fix}' s  computation. 

Conditions  sufficient  to  ensure  the  Secant  iteration's  convergence  are 
found  in  survey  texts  like  Ostrowski' s,  Traub’s  or  Ortega  and  Rheinboldt ' s , 
and  susanarized  In  Householder  (1970)  or  Dahlquist,  Bjorck  and  Anderson  (1974). 
Characteristic  of  all  such  sufficient  conditions  in  the  literature  known  to 
this  writer  is  that  they  also  suffice  to  ensure  Newton's  iteration's  conver¬ 
gence.  This  characteristic  might  suggest  that  Newton's  iteration  converges 
whenever  the  Secant  iteration  does,  but  the  facts  are  contrariwise  as  stated 
in  the  Theorem  above  and  illustrated  by  our  first  example  A  below. 

The  following  five  examples  A  to  E  all  have  only  f(0)  -  0,  so  C  ■  0, 
and  either  f'-x)  =  -fix)  or  else  ff-x)  =  fix).  Moreover,  f  is  everywhere 
at  least  once  continuously  differentiable. 
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Example  A:  f(x)  =  23x  -  10x3  ♦  3x5  . 

Despite  that  f(x)  is  strictly  monotone  increasing  for  all  x,  and  despite 

that  the  Secant  iteration  converges  to  C  from  every  real  x^  and  x^, 

Newton's  iteration  will  converge  from  x^  if  |xq|  <  r '23/27  •  .922958207... 

but  alternates  with  x  -  (-l)nx  if  x.  -  i/23/27  .  Worse,  if 

n  o  u 

Z23/27  <  x.  <  1.06977829...  then  <-l)"x  -  1. 

U 

Example  B:  f(x)  =  x(4  +  /5  +  |x|)/(l  +  (4  +  /5)|x|)  . 

Despite  that  fix)  is  strictly  monotone  increasing  for  all  x,  the  Secant 

iff 

iteration  fails  to  converge  but  cvcles  Instead  on  four  points,  x  ■  (-1)  x  , 
from  xQ -  2  +  *5  -  4.  23606797. . . ,  x^  -  1,  x^  -  -xQ,  x3  - -x^^ ,  ...  ;  otherwise 
I  conjecture  that  the  Secant  Iteration  converges  from  almost  all  starting 
points.  Newton's  Iteration  fares  worse;  it  converges  provided 
!xgi  <  x  =  ( 16  +  7*5  -  2/95  +  56/5) /II  ■  .  179351475...  ,  but  otherwise  diverges 
to  a  cycle  on  two  points,  with  (-1)  x  ■*  1 1 16  +  7/5  +  2/95  +  56/T) /II 

K 

•  5.57564413...  unless  (-D^x  -  i±  . 

n 

The  next  three  examples  illustrate  what  can  happen  when  f  vanishes 
at  ;  but  does  not  reverse  sign,  thereby  vindicating  the  proviso  in  the 
Theorem  above.  The  change  of  sign  can  be  essential  for  the  Secant  itera¬ 
tion's  convergence  though  irrelevant  for  Newton's  because  the  latter  is 
unchanged  when  f(x)  is  replaced  by  fix),. 

Example  C:  fix)  =  x"*^  for  Integer  m  1  . 

This  example's  analysis  is  facilitated  by  the  observation  that  first 

x  .  »  Six  )  is  tantamount  to  x  , , /x  -  1) 

n  »Tt1  v\ 

and  secondly 

Xn+1  "  Sfxn'*n-1}  19  t0  Xr^\,Xn  "  ♦'Wl' 
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where  $(y)  =  (v  -1)  has  been  discussed  at  the  end  of  §1  above. 

Newton's  iteration  converges  to  £  ■  0  from  every  x^.  So  does  the 

Secant  iteration  when  ft  is  even,  but  when  m  is  odd  (then  /  does  not 

reverse  sign)  the  iteration  converges  from  almost  all  Xq  and  x^.  The 

exceptions  are  first  when  x^/x^  coincides  with  the  negative  fixed  point 

of  i,  in  which  case  (-1)  x,  diverges  monotonically  to  infinity,  and 

secondly  when  x^/x^  coincides  with  one  of  a  countable  set  of  values  from 

which  will  follow  an  x  “  JO,  x  ■  -x  ,,  x  ,  ■  fx  .  / ("H-l )  and 

subsequentlv  x  .  0  as  j  -  <*. 

’t+J 

Example  D:  f(x)  =  x2 (7  -  2/5  +  x|)  /  (l  +  (7  -  2,T)x2)  . 

Newton's  iteration  converges  from  every  x^.  The  Secant  iteration  converges 
if  x^Xq  >  0  and  |x^|  <  .365966339...  but  otherwise  is  likely  to  tend  to 
cycle.  One  cycle  on  four  points  is 


xQ  -  2  +  . 5  -  4.236067977. .. ,  x^ 


r  n+2-i 


(-dV 


Another  is 


x0  -  7-7019690....  x;  -  -1.81818824.  . .,  *2  - -*Q.  Xj.-x^  x^  -  (-1)\  . 

and  this  cycle  is  stable  and  attractive  with  a  contraction  factor  per  cycle 
near  .278  . 


Example  E:  f(x)  =  1.0  -  3/(3+x  )  . 

Newton's  iteration  converges  to  Q  •  0  from  any  x^  with  x‘  <  9,  oscil¬ 
lates  with  x^  -  (-D^Xq  if  Xq  -  i3,  and  diverges  alternatlngly  to  t<*> 

2  2 
if  Xq  >  9.  The  Secant  Iteration  converges  if  x^x^  1.  0  and  both  x‘  <  3 

and  x‘  <  3  but  frequently  diverges  otherwise,  and  certainly  diverges  when 

Xg  2.  9  and  1.  9;  it  can  cycle  on  four  points  x^  •  /l5 + 6/5  »  5.33070425.. 

xl  -  -/l5  -  6/5  -  -1.25840857...,  x2  -  -xQ,  x y  -  -x^  x^^  -  (-1)\. 
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Changing  che  constant  1.0  In  the  last  example  to,  say,  1.000' ••0001 
illustrates  how  difficult  in  practice  is  the  problem  of  determining  where  a 
function  f(x)  vanishes  when  it  does  not  reverse  sign.  The  difficulty  is 
not  caused  entirely  by  roundoff.  For  example,  even  though  the  values  of 

2 

f(x)  =  (x  -  (5  -  (x  -  (5-x) )))  (Do  not  remove  the  parentheses!) 

and  its  derivative  must  be  calculated,  on  any  North  American  or  Western 
European  elctronic  computer,  precisely  (i.e.  with  no  rounding  errors)  for 
every  r  close  enough  to  3y,  none  of  those  values  of  f(x)  vanishes 
because  the  value  x  ■  3m-  ■  3.333333...  is  never  represented  precisely  in 
floating  point.  Consequently  the  Theorem’s  proviso  might  as  well  be  taken 
for  granted  in  practice. 

56.  Projective  Invariance  of  Newton's  and  the  Secant  Iteration 

Projective  transformations  are  those  which  transform  straight  lines 
into  straight  lines.  They  are  pertinent  to  Newton's  and  the  Secant  itera¬ 
tions  because  tangents  are  transformed  into  tangents,  secants  into  secants. 

The  particular  projective  transformations  useful  here  map  the  pair  {x,f(x)} 
onto  a  pair  {£,$(£.  }  in  such  a  way  tha.  either  both  or  neither  of  f'x) 
and  >(£.  are  linear  functions  of  their  respective  arguments,  and  yet  the 
mapping  is  independent  of  f  and  $.  Well-known  results  from  Projective 
Geometry  lead  to  the  following  formulas. 

Let  £  ■  o(x)  =  (oix+8)  / (yxH)  with  afi  -  Sy  "  1.  Hence  o(x)  is 
invertible;  x  •  0  V£.)  -  (8-<5£) / (y£-<i)  or,  more  symmetrically, 

(yx4^)(a-y£)  ■  1  •  (a  ♦  3/x)  (5  -  8/£) .  Having  transformed  the  variable  x 
into  £  we  further  construct 
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*(V  =  f(x)/(yx  +  &)  9  X  -  0~l(U 

-  (a- YO/r(S-  5C)/(y^-a); 

as  Che  corresponding  transform  of  f.  Now  calculation  suffices  to  verify 
that  the  transforms  of  S(x>  and  S(x,y)  are  :espectively 

n(u  =  (U  •  p^vfp'1^;;;  and 

z.'C.n;  ;  z-$(V(i:-n)/(*rzj-*rn))  -  pfsrp'Vu.o"1^;;;  , 

whence  Newton’s  Iteration  applied  to  $  takes  the  form  £  ,  ■  tffC  J,  the 

*t+l  *1 

Secant  iteration  is  . )  •  The  projective  Invariance  implied 

by  these  equations  can  be  stated  in  words  as  follows. 

Projective  Invariance:  Let  {x  be  the  sequence  of  iterates  generated 
when  either  Newton's  or  the  Secant  iteration  is  applied  to  f(x). 

Then  the  projective  transformation 

£  ■  pixj  -  < our+3 ) / ( yx+<5 )  and  ■  (a-yO/fP  with  a6  -  3>  *  1 

maps  the  iterates  {x  }  upon  the  sequence  { £  «o (x  }}  generated 

*1  K 

respectively  by  either  Newton’s  iteration  from  £q  •  pCx^J  or  the 
Secant  iteration  from  Cq  "  Ofx^J  and  ^  -  pfx^)  applied  to  $(£). 

One  application  of  projective  invariance  is  that  whatever  convergence 
Cheorv  pertains  to  the  iterations  in  finite  intervals  I  may  be  extended 
with  few  changes  to  semi- inf  ini te  intervals  pfl )  by  virtue  of  an  appro¬ 
priate  choice  for  0.  One  of  those  few  changes  is  a  nuisance  illustrated  bv 
the  example  f(x)  =  1  ♦  expf-xJ  on  the  semi-infinite  interval  0  <_  x  £  *<*>; 
both  iterations  converge  to  +•*>  but  f( 1  4  0.  Consequently  semi-infinite 
Intervals  will  not  be  mentioned  again  in  this  paper. 
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A  second  Implication  is  that  natural  hypotheses  Implying  the  iterations' 
global  convergence  should  also  be  projectively  invariant.  For  Instance, 
since  the  second  derivative 

-  (yx  ♦  6)2f"(x)  9  x  -  p“lfZJ  . 

any  hypotheses  about  the  number  of  zeros  f"  has  in  I,  or  about 
Sign  •  signf$$"),  are  invariant  provided  o(D  remains  an  interval. 

Just  such  hypotheses  aoound  ir.  the  literature  (cf.  Ostrowski  (1960  et  seq., 
ch.  9  and  10)  or  Dahlquist,  BJorck  and  Anderson  (1974,  p.  225))  and  are 
tantamount  to  the  observation  that  the  convexity  of  f' s  graph  is  a  projec¬ 
tive  invariant  implying  ultimately  steady  convergence  of  both  Iterations 
provided  they  do  not  first  escape  from  I.  More  about  this  in  §8. 

A  third  Implication  of  projective  invariance,  the  one  most  pertinent 
to  our  proof,  is  a  kind  of  mean  value  theorem  which  relates  the  two  iterat¬ 
ing  functions  S  x)  and  Stx.y)  in  a  way  more  general  than  S(xtx)  •  ,V(x) . 

Mean  Value  Le^ma:  If  S(y,z)  does  not  lie  between  y  and  z ,  (i.e.  if 

f'y/f  z)  >  0),  if  y  4  3,  and  if  f  is  differentiable  between  y 

and  c,  then  strictly  between  y  and  z  must  lie  some  t  for  which 

either  S  y.z  •  It)  or  f(t)  •  f  t)  -  0. 

See  Fig.  3 . 

Proof:  A  projective  transformation  could  be  applied  to  push  y  •  S(y,z) 
to  »  while  preserving  the  interval  between  y  and  z.  but  the  resulting 
calculations  would  boll  down  to  what  follows.  Let  ii(x)  :  /Vx)  /  (x— .•) ,  and 
observe  that  the  equation  y  «  S(y,z)  is  equivalent  to 
v(y)  ■  i>(z)  -  (f(y)~f(z))l(y~z').  Since  i>fx) ,  like  f(x) ,  is  differen¬ 
tiable  between  y  and  z,  Rolle’s  theorem  (cf.  Apoi.tol  (1967))  provides 
that  v'<t)  -  0  at  some  t  strictly  between  y  and  a.  That  t  turns 
out  to  satisfy  either  f(t)  •  f'(t)  •  0  or  v  •  N(t).  □ 


In  ocher  words,  just  as  N(x)  -  S(x,xJ  Implies  chat  Newton's  itera¬ 
tion  cannot  escape  from  an  interval  from  which  the  Secant  Iteration  cannot 
escape,  the  Mean  Value  lemma  implies  that  the  Secant  iteration  cannot  escape 
from  an  Interval  from  which  Newton's  iteration  cannot  escape  unless  / 
vanishes  in  chat  interval  without  changing  sign.  This  implication  will 
become  clearer  later. 

§7.  Inferences  from  ,V(l;  C  I 

Henceforth  we  activate  two  of  the  hypotheses  of  the  Theorem  of  §5. 

First,  I  is  a  closed  finite  interval  in  which  f(x)  is  continuous,  and 

continuously  once  differentiable  too  except  possibly  where  f  vanishes. 

Second.  .VCx)  maps  I  into  Itself.  We  do  not  yet  assume  that  x  . ,  •  S(x  ) 

K 

converges.  What  do  these  hypotheses  te  11  us  about  f  and  .V? 

Lenina :  f  has  Just  one  zero  £  in  I,  and  £  divides  I  into  at  most 
two  sub- lnterva Is  Inside  each  of  which  f(x)  is  strongly  monotonlc 
(l.e.  /'  x)  cannot  vanish  inside  either  sub- interval) . 

Proof:  At  the  outset  we  beg  the  reader  to  put  up  with  an  abuse  of 
language;  in  the  unlikely  event  that  •*  vanishes  throughout  a  non-degenerate 
sub-interval  of  I  we  shall  count  that  sub-interval  as  a  single  zero  £  of 
f  in  I  and  write  "x  ■  C"  when  we  mean  "x  belongs  to  the  interval 
This  perversion  avoids  c ircumlocut ion  in  dealings  with  functions  that  are 
not  analytic  but  merely  differentiable. 

The  essential  hypothesis  is  that  S(\)  C  I.  Now  f  cannot  have  two 
or  more  distinct  zeros  in  I  because  otherwise  Rolle's  theorem  would  supply 
between  two  adjacent  distinct  zeros  of  f  at  least  one  where 

f'  (Xq)  •  0  4  f(  Xq )  whence  *  *  would  escape  from  I.  Neither  can 

f  fall  to  vanish  in  I  since  otherwise  f  would  take  non-zero  values 
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vlch  like  signs  sc  I's  ends,  whereupon  the  Secant  Iteration  started  from 
I's  ends  would  escape  from  I  to  a  place  whither  Newton’s  iteration,  accord¬ 
ing  to  56’ a  mean  value  lemma,  could  escape  coo.  Therefore  f  does  have 
Jusc  one  zero  £  in  I.  If  f  were  not  strongly  monoconlc  strictly 
between  £  and  either  end  of  I,  I  would  contain  some  x^  where 
f'x^)  •  0  ^  f(x q)  so  again  .VfXg)  ■  *  would  escape  from  I.  □ 

Corollary:  No  x  f  £  In  I  can  separate  £  from  .VfxJ;  l.e. 

VMx)  -x)/(£-x)  >  0  for  every  x  in  I  except  x  ■  £.  And  If  Newton's 

Iteration  r  •  !t(x  )  converges  from  some  xn  In  I  It  must  con- 

verge  to  £,  the  zero  of  f  In  I. 

Proof:  VHx)  -  x)/(£-x)  •  -f(x>f ((t-x)f’(xj)  Is  continuous  and  non¬ 

vanishing  In  I  strictly  between  I's  ends  and  £,  and  therefore  conserves 
the  positive  sign  it  enjoys  at  I's  end(s)  different  from  £.  And  if 
x  s  11m  x  exists  and  If  ,V  Is  continuous  at  x  (else  x  -  £)  then 

®  91  00  00 

n-~»> 

*7x  )  -  lim  f(x  )  ■  1  1m(x  -x  ,  )f*(x  )  ■  0*f'fx  1  •  0  so  x  ■  £.  0 

'  »  *  n  *t  n+1  *  n  »  ® 

rj-eoo 

The  corollary  foreshadows  some  technical  arguments,  the  burden  of  the 
rest  of  this  section, 57,  concerning  the  continuity  of  .V (x!  at  x  •  £  where 
.**'  £7  has  not  been  assumed  to  exist.  Elsewhere  S(x)  •  X  -  fix)  If '  x)  is, 
like  fix  and  f'(x),  continuous.  But  at  x  •  £  we  shall  Infer  scarcely 
more  than  that  Six)  is  Darboux  continuous,  which  means  that  .V  assumes 
in  everv  sub-interval  around  £  all  values  between  those  V  takes  at  that 
sub- interval's  ends.  Darboux  continuity  is  an  intermediate-value  property 
possessed  not  only  by  continuous  functions  but  also,  for  Instance,  bv  deri¬ 
vatives  even  when  they  are  discontinuous.  For  more  details  see  Bruckner 


and  Ceder  (1963) . 
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The  trouble  with  S  la  not  necessarily  caused  by  our  failure  to  assume 
chat  f'  exists  and  la  continuous  at  C>  Tor  example  consider 

fix)  =  (l+x)expisina  ♦  \/x)  -Hx)  for  0  <  x  ^  1  , 

=  -expi'l/x)  for  -1  <  x  <  0  , 

=  0  at  x  -  0  . 

This  fix)  Is  infinitely  differentiable  in  the  interval  I  =  {-l£x^l}, 
vanishes  only  at  x  ■  0  and  is  elsewhere  in  I  strongly  monotonlc  increas¬ 
ing.  However 

Six)  -  x^l  -  (X^I)COS('I  ♦  l/x;)/(x‘  +x+  1  -  (l+ricosfl  ♦  1/xl)  for  0  <  x  1  , 

“  x(l-Kr)  for  -1  <  x  <  0  , 

■  0  at  x  •  0  , 

behaves  disc  one  lnuously  as  x  -  O*-.  None  the  less  we  mav  infer,  after  veri¬ 
fying  first  that  -1  £  Six)  <  x  for  0  <  x  <  1  and  second  that  x  <  Six)  <_  0 
for  -l  <_  x  <  0,  that  Newton’s  Iteration  converges  to  0  (slowly!)  from 
every  xQ  in  I. 

Proposition:  The  functions  Six),  Six  -x,  SiSixJ)  and  S(S(x))-x  are 
Darboux  continuous  everywhere  in  I  Including  at  ;,  and  the  first 
two  are  continuous  everywhere  in  7  except  possibly  at  C;  this  means 
that  each  of  these  functions  assumes  in  every  sub-interval  of  I  all 
values  between  the  ones  taken  at  chat  sub-interval’s  ends. 

Proof:  The  proposition  will  be  proved  for  Six)  first,  for  which  we 

need  only  be  concerned  with  sub-intervals  of  I  that  contain  C.  And  If 

such  a  sub-interval  is  divided  at  £  into  two  parts  for  each  of  which  the 
proposition  is  verified  separately,  the  proposition  will  be  verified  for 
the  whole  sub- interval . 
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For  definiteness  choose  any  n  <  Z  In  7  and  let  us  verify  che  propo¬ 
sition  for  Che  closure  of  Che  unclosed  interval  J  =  {n^x<c).  Since 
X(x)  is  continuous  in  J  its  image  X(J)  is  also  an  Interval,  and  to  verify 
the  proposition  we  need  only  show  chat  its  closure  contains  Z  ■  H(^).  Now 
there  are  only  two  cases  to  rule  out,  namely  C<  closuref.Vf and 
Z  >  cloaureiX(J) ) . 

If  for  some  e  >  0  we  found  Z  <  Z * 2c  <  X(x)  throughout  J,  i.e. 
for  n  <  x  <  ;,  we  should  have  to  find  that 

1>(xJ  =  lf(x)/f(r\)  -  /(c+€-n) 

takes  values  of  opposite  sign  at  J's  ends; 

-  l  >  o  >  -e/(c+€-n)  •  4>fzJ  , 

so  ii  must  first  change  sign  somewhere  in  J,  say  at  £  where 
v(i)  ■  0  >_  ’4>f  .  But  this  would  imply  2f(£)/f(  r\)  ■  (;+€-£)  /(C+€-n)  >  0 

and  -2f(Vlf(r\i  >  l/(C+c-n)  >  0  whence  !t(V  -  f.  ♦  [f(V  /'fnJ )/(-/'  (V /f(n)) 
^  £♦(£+€-£)  ■  Z  *■  C  <  £  +  2e!  Therefore  Z  f  closure (! V( J))  . 

On  the  other  hand,  if  for  some  c  >  0  we  found  V(x)  <  Z  ~  2e  <  Z  for 
all  n  £  x  <  ;  we  should  have  to  violate  the  foregoing  corollary's 
Inequality  0  <  {X(x)  -  x)  /  ( I~x)  at  x  ■  max{ £-e ,n) .  Therefore 
Z  ^  closure.  N(J) ) . 

Thus  we  conclude  that  X(x)  is  Darboux  continuous.  Consequently 
X(ltfxJ)  is  Darboux  continuous  Coo.  Moreover,  we  shall  now  find  that  S(x)% 
and  Chen  X(X(x) ) ,  belong  to  Che  first  Baire  class  of  functions,  the  point- 
wise  limits  of  continuous  functions.  This  is  evident  because  we  can  approxi¬ 
mate  X(x)  by  a  continuous  function  differing  from  .V  only  in  an  open 
deleted  neighborhood  of  Z,  though  both  functions  match  at  Z  and  at  the 


boundaries  of  Che  neighborhood,  and  Chen  lec  Che  neighborhood  shrink  down 
onco  £.  The  flrsc  Baire  class  is  significant  because  Che  sum  of  a  Darboux 
continuous  function  in  chac  class  wlch  a  continuous  function  is  known  Co  be 
another  Darboux  continuous  function  in  that  class;  see  Theorem  7.5  of 
Bruckner  and  Ceder  (1965,  p.  109).  Therefore  S(x)~x  and  N(N(x))-x 
are  Darboux  continuous  and  the  proposition  is  proved.  □ 

This  proposition  is  crucial;  it  allows  Part  I's  No  Swap  Theorem  to  be 

generalized  enough  to  cover  Vix).  One  contrary  implication  of  the  No  Swap 

condition  Chat  can  be  obtained  almost  inmiediacely  is  the  following,  which 

shows  that  the  last  example  above  is  typical  of  Che  kind  of  discontinuity 

chat  can  befall  V  without  precluding  convergence.  As  is  customary,  we 

write  x  ♦  to  mean  that  x  Increases  to  the  limit  C.  and  x  ♦ 

when  r  decreases  to  C.  Can  .V  be  discontinuous  both  as  x  ♦  Z*  and  a* 

•  ^  *» 

^  a  • 

Aside:  If  X(x)  swaps  no  two  distinct  points  in  7,  S(x)  cannot  be  dis¬ 
continuous  on  both  sides  of  £  but  at  most  one;  and  as  x  *  Z  from 
Che  side  opposite  the  discontinuity  N(x)  must  ultimately  lie  between 
Z  and  x. 

Specifically,  if  ,V(x)  -f*  C  a*  "  *  Z~  set  n  =  11m  Sup  X(x)  >  £; 

x-*  C- 

then  Z  £  H(x)  <  x  whenever  Q  <  x  £  n. 

Proof:  Suppose  on  the  contrary  that  11(v)  <  Z  and  Z  <  v  <  n.  Since 
.V  is  Darboux  continuous  at  Z  •  ,V  must,  for  any  y  <  Z  in  I, 

assume  all  values  between  £  and  n  as  x  runs  from  y  up  to  Cl  there¬ 
fore  S(u)  •  V  for  at  least  one  u  in  y  <  u  <  Z>  Therefore  X(N(u)',  -u 
•  S('J)  -u  <  .V(y)  -  y  <  0  provided  y  be  first  selected  in  S(v)  <  y  <  £; 
on  the  other  hand  !t(S(x))-x  >  0  for  x  close  enough  to  I's  left-hand 
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end  since  S(l)  C  I.  Consequently,  because  S(Six) )  -  x  Is  Darboux  continuous 
too,  SiSix^))  -xQ  •  0  for  at  least  one  xQ  <  u  <  £  in  I,  and  then 
x^  •  .V(Xg7  >  xo  ^ecause  °f  the  Corollary  above.  So  .V  would  swap  the 
distinct  points  x^  and  x^  contrary  to  hypothesis.  □ 

58.  The  No  Swap  Theorem  for  Newton's  Iteration 

Theorem:  Suppose  I  is  a  closed  finite  interval  in  which  f(x)  is  continuous, 

and  continuously  once  differentiable  too  except  possibly  where  f 

vanishes.  Let  Six)  =  x  -  fix) / f' lx)  except  Six)  I  x  when  fix)  ■  0. 

Then  Newton's  iteration  x  ,  “  Six  )  generates  from  everv  x.  in  I 

n-fl  t  *  0 

a  sequence  {x  }  of  which  some  subsequence  converges  to  a  zero  £  of 
n 

f,  l.e.  a  subsequence  of  {fix  7}  converges  to  zero,  if  and  only  if 
Si l)  C  I  (and  therefore  £  is  unique)  and  V  satisfies  any  of  the 
following  conditions  (they  are  equivalent). 

The  No  Swap  Condition:  if  x  -  SiSixJ)  in  I  then  x  -  Six)  («  £). 

The  No  Separation  Condition:  Either  S(Six))  <_  z  <_  Sis)  or  Six)  <_  z  <  SiS(x)) 

In  I  implies  SiSiz!)  ■  a  •  Sis)  (■  £) . 

The  No  Crossover  Condition:  If  S(v)  <  a  <  »  <  Si u)  in  I  then 

Siv)  -  u  ■  y  •  Siu)  (-  £). 

The  One-Sided  Condition:  Whenever  x^  •  S(x^)  4  x^  in  I  all  subsequent 

Iterates  x  •  .VCx  )  also  differ  from  x„  and  lie  on  the  same  side 
n+1  n  0 

of  Xq  as  does  x^  (and  £)  . 

If  also  S  is  continuous,  or  if  also  either 

.VClIm  sup  Six))  i  ;  or  .vrllm  Inf  Six))  *  £ 
x  *  £-  X  *  £♦ 

then  any  of  the  foregoing  conditions  implies  that  "  .V(x^7  *  C 

from  every  x^  in  I. 
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Proof:  Only  when  N  Is  not  continuous  need  the  proof  involve  a  little 
more  work  than  merely  writing  N( •-•)  in  place  of  $(•••)  in  part  I. 

Having  established  in  §7  that  N(x) ,  S(x)-x,  S(K(x))  and  N(N(x) )  -  x 

are  Darboux  continuous,  we  may  infer  as  before  that  certain  expressions 
must  vanish  somewhere  between  any  two  points  at  which  they  change  sign,  so 
most  of  Part  I's  arguments  will  not  need  revision.  But  arguments  that  for¬ 
merly  depended  upon  lim  $(x)  ■  im  x)  must  be  revised  lest  S(x) 
as  x  -  " .  The  first  such  revision  is  needed  to  validate  the  No  Separation 
condition;  please  turn  back  to  and  re-read  §2  in  conjunction  with  what 
follows . 

To  show  that  No  Swap  implies  No  Separation  assume  the  latter  condition 
violated  by,  say,  .Vi 'N(z))  <  z  <  ,V<'«7  in  I  and  seek  a  consequent  violator 
V  of  the  former.  As  before,  .V  must  have  a  fixed  point  y  strictly 
between  z  and  .V (z),  but  this  time  S(\)  C  I  implies  y  •  £  is  the 
unique  fixed  point  of  .V,  the  zero  of  f  in  I  (cf.  §7).  Also  as  before 
S(Slx))  has  a  fixed  point  V  <  z  In  I,  but  this  time  we  don’t  care 
whether  9  is  a  "first"  fixed  point  or  not  because  y  <  z  <  £  so  y 
cannot  be  «V’s  unique  fixed  point  C.  Therefore  .V  swaps  y  and 
S(v)  *  y. 

The  No  Crossover  and  One-Sided  conditions'  proofs  survive  unchanged. 

The  next  revision  is  needed  to  adapt  54 's  proof  that  One-Sidedness  follow¬ 
ing  from  the  No  Swap  condition  implies  convergence,  because  now  that  impli¬ 
cation  is  Invalid. 

As  before,  the  sequence  of  Iterates  x  ,  •  S(x  )  is  a  One-Sided 

n+1  n 

sequence  which  is  either  ultimately  monotonic  and  therefore  convergent  to 
C  (cf.  S7's  Corollary)  or  else  can  be  partitioned  into  two  disjoint  infi¬ 
nite  subsequences,  one  ascending  to  a  limit  x  and  the  other  descending  to 
a  limit  x  >  x.  It  is  not  possible  for  both  limits  to  differ  from  £ 
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because  Chen  iV  would  be  continuous  ac  both  limits  and  §4' 3  argument  would 
make  them  both  equal  £.  Therefore  at  lease  one  of  x  and  x  equals 
as  was  claimed  above. 

Moreover,  suppose  for  definiteness  that  x  >  x  ■  C;  conclude  the 

proof  by  showing  that  n  =  1  im  sup  S(x)  >_  x  >  Z  m  .V(n).  Since  some  of  those 

x  -  C- 

ascendlng  x  *  x  ■  C  are  followed  bv  x  ,  •  S(x  )  •*•  x+  >  Z  we  must  find 
*1  *!+l  *1 

C  <  x  <  n.  Moreover,  since  .V  is  continuous  at  x,  all  chose  descending 
x  -*  x+  close  enough  to  x  comply  with  Che  No-Man's  Land  lemma  (53)  by 
having  x^+^  -  .Y(x^)  ■*  so  N(x)  ■  C.  But  if  any  such  x^  lay  in 

x  <  x  <  n  we  could  infer  from  §7's  last  Aside  that  ;  <  x  ■  .Vfr  )  <  x 

contrary  to  the  last  sentence;  therefore  no  descending  x  lies  in 

x  <  X"  n  and  hence  x  •  n  and  ,V(n)  “  C  as  claimed. 

Here  is  an  example  to  show  that  the  phenomenon  x  <  x  analyzed  in  the 

last  paragraph  is  possible  albeit  unlikely.  Let  I  •  {-l£x<3}  and  therein 


f(x)  5  ( Jx-2)  ‘ 


for  1  <  x  <  3  , 


for  0  <  x  <  1  , 


;  —  *  1  — x  exp^sirv-l/x;  ♦  l/x)  for  -1  <  x  <  0  . 


This  f'x)  is  once  continuously  differentiable  (it  could  be  modified  on 
0  <  x  <  2  to  be  made  Infinitely  differentiable  and  substantially  more  com¬ 
plicated).  it  vanishes  only  at  ;  :  0  in  I  and  is  elsewhere  strongly  mono¬ 


tonic.  .V.'x)  -  x  -  f(x)/f  (x)  maps  I  into  I  and  swaps  no  two  distinct 

points,  and  is  continuous  too  except  as  x  *  £-.  If  Newton's  iteration 

x  -  V  x  )  is  started  at  xn  •  -l/(2ir)  then  x,  -  -1/(2**^)  •*  C-  but 
n+1  i  0  2  n 

x,  . .  •  2  -  x,  *  2+.  □ 

2n+l  2n 
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How  may  a  function  f  eligible  for  the  application  of  the  No  Swap 
theorem  be  recognized  in  practice?  One  way  is  to  apply  the  contrapositive 
of  the  next  lemma  which  exhibits  some  of  the  properties  possessed  by  f 
when  .V  violates  the  No  Swap  condition.  Any  f  which  lacks  those  proper¬ 
ties  satisfies  that  condition.  Fig.  9  illustrates  these  properties. 

Lenina :  If  Six)  ■  x-flx)lf'(x)  swaps  the  ends  u  and  v  of  an  interval 

J  in  which  f  is  twice  differentiable,  then  one  of  the  following 

situations  must  arise. 

1.  f(u)flv)  >  0,  and  then  f'lu)f'(v)  <  0  so  f  must  reverse  sign 
somewhere  inside  J,  and  f"  must  take  the  same  sign  as  flu) 
somewhere  in  J;  if  also  f  vanishes  somewhere  in  J  then  ff" 
must  take  negative  values  in  two  open  sub-intervals  of  J  between 
which  f"  reverses  sign  at  least  twice. 

2.  flu)f(v)  <  0  so  f  reverses  sign  at  least  once  inside  J,  and 
then  ff"  takes  negative  values  at  places  inside  J  where  f 
takes  both  positive  and  negative  values:  therefore  f”  reverses 
sign  at  least  once,  and  at  least  once  more  if  f  ever  vanishes 

in  J. 

Proof:  This  is  a  tedious  exercize  in  curve-tracing  whose  object  is  to 
describe  the  ups  and  downs  of  f  in  J.  For  definiteness  assume  u  <  U 
and  f( V)  >  0,  and  re-write  u  -  X(v)  and  y  *  VfuJ  >  u  in  the  forms 

f'(v)  •  y-u)  >  0  and  f’(u)  ■  f(u)/(u-v)  . 

Case  1:  flu)  >  0.  Now  flu)  <  0  <  f  ' v )  and  consequently  f"  must 

take  positive  values  somewhere  between  u  and  V.  If  also  at  some  n 

between  u  and  V  we  find  f(r\ )  -  min  fix)  £  0,  then  f'(i\)  *  0  and 

u<x<v 

f”  >  0  somewhere  in  that  neighborhood  of  n  wherein  f"  >0;  we  find 
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further  that,  as  x  increases  from  u  to  n  to  v,  fix)  moves  from 
flu)  <  0  through  some  lesser  value  (.f(r\) -flu)  )  /  (n-u)  <_  -f  (u)  /  (n-u) 

<  -f(u)/\'>- u)  ■  flu)  and  then  moves  to  a  higher  value  f  (r\)  •  0  and  on 
up  through  (flv)-flr\))/(-j-n)  >f(v)/(v- n)  >  f(v)!{-j-u)  -  flv)  and  back 
down  to  flv).  Hence  f  has  a  negative  minimum  and  positive  maximum 
inside  J,  which  means  chat  /"  reverses  sign  at  least  twice  in  J .  As  r 
increases  from  u  to  y,  fix)  has  to  decrease  before  fix)  can  vanish, 

and  therefore  ff"  <  0  in  some  sub-interval  before  f  vanishes;  similarly 

ff”  <  0  somewhere  between  the  last  zero  of  f  and  v. 

Case  2:  flu)  <  0.  Now  f(Q)  “  0  at  some  first  Z  between  u  and  V. 

As  x  increases  from  u  to  Z,  fix)  moves  from  a  positive  value  flu) 
through  a  larger  value  (f(Z)-flu))/(Z~u)  >  -flu)/(v-u)  -  flu),  so  fix)  >  0 
somewhere  between  u  and  Z.  Similarly  f  <  0  somewhere  between  f s 

last  zero  and  V.  More  precisely,  f  has  a  positive  maximum  inside  J  at 
which  f”  reverses  sign.  If  f  ever  vanishes  inside  J  then  f  has  a 
non-positive  minimum  at  which  again  f  reverses  sign  at  least  once  more. 

□ 

The  next  coroliarv  is  an  advance  beyond  what  was  previously  known 
because  It  deals  with  functions  f  whose  graphs  are  not  convex  but  may  have 
at  most  one  or  two  Inflexions.  As  long  as  Newton's  Iteration  cannot  escape 
from  I  via  the  ends  or  the  inflexion  points  of  f s  graph,  the  outcome 
turns  out  to  be  the  same  as  if  that  graph  were  convex  (cf.  Dahlquist,  Bjorck 
and  Anderson,  1974,  p.  125). 

Corollary:  Suppose  V  and  s  are  the  ends  of  a  closed  (possibly  infinite) 
inrerval  l  in  which  f  is  twice  differentiable,  f  never  vanishes 
except  possiblv  if  and  where  f  vanishes,  f"  reverses  sign  at  most 
once  except  possibly  again  if  and  where  both  f  and  f  vanish 
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simultaneously,  and  at  least  one  of  f(v)f(z),  or  f(z)f"(z) 

Is  positive.  Suppose  too  that  S(x)  =  x  -  f(x) If'  (x)  maps  into  I's 

Interior  both  I’s  ends  and  the  places  if  any  where  f"  reverses  sign. 

f"  may  vanish  arbitrarily  often  without  changing  sign.  Then  f  has 

in  I  Just  one  zero  £  and  Newton's  Iteration  x  ,  ■  H(x  )  converges 

>1+1  n 

to  it  from  every  x^  in  I. 

Proof:  An  argument  similar  to  §7's  lemma  shows  that  f  must  have  just 
one  zero  £  in  I;  more  would  violate  a  hypothesis  by  providing  some  x 
in  I  at  which  /'  x)  ■  0  i  f  xJ  ,  and  fewer  would  either  do  the  same  or 
violate  a  different  hypothesis  by  forcing  V  to  map  at  least  one  of  I's 
ends  outside  I. 

A  second  argument  proves  V  continuous  in  I;  this  is  obvious  from 
the  formula  for  V  when  f  +  0,  so  only  the  possibility  /'f£j  •  0 
needs  further  explanation.  When  X  is  confined  to  a  small  neighborhood 
of  £  in  which  /"  never  reverses  sign  except  possibly  at  £,  fix) 
must  be  monotonic  separately  on  each  side  of  £  and  consequently 
0  <  f ’  v.  If  x'  <  1  for  everv  ,  strictly  between  £  and  r  in  that  small 
neighborhood.  Since  f  x  •  f  j  d- . 


consequently  .Y<r  always  lies  between  £  and  x  in  that  small  neighbor¬ 
hood.  Therefore  It  is  continuous  at  £  •  .V  £  and  hence  throughout  I 
even  when  f  (Z)  •  0. 

The  next  task  is  to  infer  ,7/ 1  C  J.  Were  this  not  so  despite  that  .V 
maps  I’s  ends  Inside  I,  .V  would  have  to  achieve  either  a  maximum  or  a 
minimum  value  ,V  x  outside  I  at  some  x  Inside  I,  and  that  x  ^  £ 
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because  V  ;  ■  Q  .lea  lnsld«  I.  Moreover,  .V'  would  have  Co  reverse 

sign  at  x  or  at  the  ends  of  that  sub-interval  of  I  containing  x  on 
which  V  ■  V  x) .  But  S'  •  ff"  f (f ) ~  Is  prevented  by  hypothesis  from 

reversing  Its  sign  in  I  except  possibly  at  C  and  at  most  one  other  place 

also,  by  hypothesis,  carried  by  .V  Into  I’s  Interior.  Therefore  S(x) 
lies  Inside  I  after  all,  and  hence  S(l)  C  I. 

Sow  let  us  verify  that  V  satisfies  the  No  Swap  condition.  If  not,  .V 

would  swap  the  ends  of  some  sub-interval  J  and  the  leama  above  would  imply 

that  f"  changes  sign  at  least  once  Inside  J,  at  least  twice  inside  J  if 
f  '  *0.  and  at  least  once  outside  J  between  Its  end  and  that  end  of  I 
where  ff"  >  0.  Thus  f"  would  vanish  more  often  than  allowed  by  the 
hypotheses.  Therefore  .V  does  satisfy  the  No  Swap  condition,  and  Newton's 
iteration  does  converge  to  □ 


AddI  ication  1:  Suppose  j'x  and  h(xJ  are  thrice  differentiable  for  all 
x  ^  0  and  •  h(0 )  ■  0  but  j'  >  0,  g"  0,  g"'  >0,  h '  >  0, 

h"  >  0  and  h"'  >_  0  for  all  x  *  0;  and  let  f ( x)  5  g<x)  -xhd/x)  be 
the  function  whose  zero  '  is  sought.  Such  a  computation  Is  encountered 
In  certain  financial  transactions  In  which  X  ■  1  * t  is  related  to  the 
Interest  rate  i ,  jix)  represents  the  present  value  of  various  past 
Investments,  xh  \!x)  represents  the  present  value  of  anticipated  returns 
from  those  Investments,  and  f  x)  Is  the  net  present  value  of  the  trans¬ 
action.  where  f(t,)  •  0,  determines  the  putative  rate  of  return  on 

money  invested  In  the  transaction. 

Newton's  Iteration  can  be  shown  to  converge  to  /'a  sole  positive  zero 
;  from  every  positive  starting  Iterate  x^  by  invoking  the  corollary  above. 
Note  that  for  all  x  >  0 


(xhd/x  )* 


h(lfx)  -  h'(l/x)/x 


;l/x 

{h'fui-h'f i/x))dv  <  0 

'0 


whence  f  »  0.  And 


( X )  -  g"'  ( x )  +  lx  h"(l/x)  +x  h"'  C l/x)  >  0  so 
can  reverse  sign  at  nose  once  for  x  >  0.  Moreover,  for  all  x  >  0 

six)  =  x-f(xHf*(x)  -  (x^'fx^ -^fx;  ♦n,fi/x;)/(<7,fx;  -  uha/xn  •)  >o 

rx 

because  xg '  x^  -g,x)  •  (j'  X>~g'  J>) du  >  0;  therefore  ,V  naps  the  posi- 

J0 

tive  real  axis  to  itself.  Finally  observe  that  xhl'l/xJ  —  h'(0)  >0  as 
x  »  ■*->,  while  lim  ( xhll/x ))  -  Tim  h(y'/y  -  (either  ♦**>  or  >  0, 

and  consequently  f  x  and  x  -  V  x  must  take  negative  values  in  the  neigh 
borhood  of  x  ■  Oe,  positive  values  in  the  neighborhood  of  x  ■  and 

since  >  0  either  f”  r  0  for  all  x  >  0  or  else  ff”  must  take 

positive  values  in  at  least  one  of  those  neighborhoods.  Therefore  f 
satisfies  the  coroilarv’s  conditions  in  some  closed  sub-interval  I  of  the 
positive  real  axis.  □ 


Application  2:  Suppose  £  and  n  are  two  consecutive  distinct  aeros  of 

f  between  which  f  is  twice  differentiable,  and  suppose  f( £J  f7 r\)  <  0. 

Then  strictly  between  £  and  n  lies  Just  one  zero  Z  of  f ,  and  £’s 

catchment  basin  for  Newton’s  iteration  x  ,  •  x  -  x  '  /*'  x  includes 

n*i  n  n  n 

in  its  interior  at  least  one  place,  also  strictlv  between  £  and  h,  where 
f”  reverses  sign. 


Proof:  The  term  "catchment  basin"  was  defined  near  the  middle  of  54. 

Let  V  x)  r  x  -  fix  /fix)’,  it  is  continuous  at  Z  because  flZ)  t  0* 
Therefore  by  restricting  x  and  y  to  a  sufficiently  small  neighborhood 

around  ;  we  may  ensure  that  l-f  y)/f(x)  is  as  small  as  we  please. 

t* 


Consequently  (V  x;-£)/(x-L)  * 

small  as  we  please  for  all  x 


j  (1  -f  y  /f  X))4y/(X-Z)  mav  be  made  as 
close  enough  to  C.  and  hence  C  is  a 


strongly  attractive  fixed  point  of  V,  which  has  no  other  fixed  point 


between  ;  and  n.  By  Invoking  §4's  lemma,  or  otherwise,  we  deduce  that 
the  ends  of  C ' *  catchment  basin  lie  strictly  between  £  and  n,  straddle 
i ,  and  are  swapped  by  .V.  The  lens&a  above  implies  now  that  /"  reverses 
sign  at  least  once  inside  the  catchment  basin  as  claimed.  □ 

This  result  is  significant  because  it  permits  all  the  real  zeros  of  a 

function  f  in  any  Interval  to  be  calculated  quickly  via  Newton's  iteration 

provided  first  all  the  zeros  of  two  consecutive  derivatives  and 

(n  >  1)  in  that  interval  are  known.  Having  straddled  a  zero  of  f 

with  two  consecutive  (approximate)  zeros  of  compute  ^  at  the 

enclosed  zero(s)  of  to  str4ddle  the  desired  zero  of  ^  more 

closely,  and  then  start  Newton's  iteration  from  one  of  the  straddling  zeros 

of  The  Iteration  (not  Just  a  subsequence  of  it)  must  converge  to 

„(n-l) 

the  straddled  zero  of  ,  and  must  do  so  One-Sidedly  and  rapidly  (faster 

than  any  geometric  progression) ,  when  started  from  the  right  one  of  those 
straddling  zeros  of  .  The  right  one  can  be  distinguished  from  the 

wrong  one  when  iterates  started  from  the  wrong  one  straddle  the  right  one. 

The  rapidity  of  convergence,  the  fact  that  the  precision  of  the  computation 
need  not  much  exceed  whatever  is  required  to  separate  adjacent  zeros  (multiple 
zeros  announce  themselves  first  as  simple  zeros  of  a  higher  derivative),  and 
the  freedom  from  Sturm  sequences  are  three  reasons  for  considering  the  fore¬ 
going  vaguely  described  algorithm  as  a  potential  replacement  for  others  that 
have  appeared  elsewhere;  cf.  Helndel  (1971),  Collins  (1974)  and  Verbaeten 


(1975). 


§9.  The  Secant  Iteration 


Theorem:  Suppose  I  Is  a  closed  finite  interval  in  which  f  is  continuous 
and  continuously  once  differentiable  too  except  possibly  where  f 
vanishes  in  I.  Suppose  too  that  Six),  defined  by 


S(x )  =  x  -  f(x)  If*  (x)  except  S(x)  z  x  when  f(x)  •  0  , 


naps  I  into  Itself  and  satisfies  the  So  Swap  condition  or  one  of  its 
equivalents  (see  $8),  as  oust  be  the  case  if  Newton's  iteration 
x^^  •  .V  x^J  converges  in  I  from  every  x^  in  7.  Finally  suppose 
f  reverses  sign  across  its  (necessarily  unique)  zero  (  in  I.  Then 
the  Secant  iteration  x  •  Six  ,x  ,7,  where 


Slx.y)  =  Sly.xi 
z  Six) 


x  -  flx){x~y)l[f(x)~f(y)]  if  y  i  x  and  fix)  f  0 

if  y  -  x  or  fix)  ■  0 


generates  from  every  x^  and  x^  in  I  a  sequence  {x^}  of  which 

some  subsequence  converges  to  C;  i.e.  a  subsequence  of  { fix  ) }  con- 

verges  to  0.  If  also  .V  is  continuous  then  x  •* 

n 

Proof:  Our  strategy  is  to  ldentlfv  certain  subsequences  of  {x  } 
which  converge  monotonlcally  to  The  Mean  Value  leans  of  $6  and  the  pro¬ 

perties  of  V  and  f  exposed  in  §7  and  §8  will  be  exploited  heavily.  For 
limtance.  57' s  lemma  implies  that  in  I  f  is  monotonlc,  has  Just  one  zero 
and  is  strongly  monotonlc  except  possibly  at  l  where  f*  may  vanish 
or  fall  to  exist  but  cannot  reverse  sign.  Moreover  Six)  is  continuous  in 
I  except  possibly  at  x  •  £,  and  Slx,y)  la  continuous  in  I  «  I  except 
possibly  at  x  •  y  ■  These  Inferences  are  but  the  first  of  a  long  chain 
which  has  been  organized  into  a  list  of  propositions  numbered  for  easier 
reference.  Some  of  the  propositions,  like  the  first,  have  a  proof  so 
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straight  forward  that  It  is  omitted. 


Proposition  1:  x  •  S(x,y)  for  some  x  and  y  in  I  if  and  only  if 

x  ■  consequently  the  Secant  iteration  x  ■  S(x  ,x  ,)  has  everv 

*1+1  *1—1 

x. . .  -  x  i  0  for  n  >  1  unless  x  •  Z. 

*i+l  n  —  *i 

To  avoid  trivial  nuisances  we  assume  henceforth  in  the  theorem's  proof 
that  all  x  P  Z. 

Ft 


Proposition  2:  If  x  and  x  ,  both  lie  in  I  so  does  x  ,  ■  S'x  ,x 

*1  *1-1  FT-fl  Ft’  Ft-1 

and  therefore  all  Secant  Iterates  x  lie  in  I. 

Ft 

Proof:  Recall  S(l)  C  I  and  Invoke  §6's  Mean  Value  lemma  after  observing 

that  if  ;  lies  between  x  and  x  ,  so  must  x  ,  since  f(x  )f(x  .)<  0. 

Proposition  3:  If  some  subsequence  of  (x^  -x^)  converges  to  zero  then 
the  corresponding  subsequences  of  {x^}  and  {x^+^}  converge  to 

Proof:  Since  S(x,y)-x  Is  continuous  throughout  the  compact  square 
I  «  I  except  possibly  at  the  one  point  x  •  y  •  C.  and  since  S(x,y)  -x  •  0 


only  on  the  line  x  ■  £  according  to  proposition  1,  x^^  -  x^ 


Sfx  ,x  .< 

*t  *t-l 


-  0  Implies  x  -  Z  *  0  and  hence  x  , 
»t  *t+l 


as  claimed. 


Definitions:  An  iterate  x  ■  J  x  , ,x  is  called  a  Variance  whenever 

n  »i-l  *1-2 

f  x  %)/f!x  )  <  0,  and  then  x  ■  S(x  ,x  ,)  and  Z  *uet  both  lie 

*1-1  *1  1*1  *!  *1-1 

strlctlv  between  x  and  x 

n  *1-1 

An  Iterate  x  ■  S>x  , ,x  Is  called  a  Permanence  whenever 

*i  *1-1  *i-2 

f(x  ,)/f(x  )  >  1,  and  then  x  ■  S(x  ,x  .)  and  Z  oust  both  lie 

strictly  on  the  same  side  of  both  x  and  x  , . 

*1  1 


Proposition  4:  Everv  Iterate  x  •  Sfx  ,  ,x  .)  with  *i  >  2  is  either  a 

7  n  *i-l  *t-2  — 

Permanence  or  a  Variance. 


See  Fig.  10. 
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Proof:  Only  Che  proof  chat  when  ►:  >  2  and  f(x  ,)!f(x  )  $  0  then 

n-1  n 

'  >  *  r*quir«*  as  much  effort  as  Invoking  the  monotonicity  of 

f  in  two  cases.  f(xn_2)/f(xn-l)  <  0  and  I'(xk-2)  >  °‘ 

Proposition  5:  If  two  consecutive  Iterates  x  -  S(x  , ,x  ,1  and 

n  *t-l  n-2 

xn4.l  ■  Sfx^tX^^)  are  both  Variances,  then  x^+1  lies  strictly 

between  x^_^  and  x^,  and  x^+.,  and  Z  both  lie  strictly  between 

x  .  and  x  ,  and 
n+1  *? 


‘Wl^-WW  "  ‘  ' 

'et  Fig.  11. 

Proof:  Only  the  last  Inequality  Is  an  unobvious  Inference.  By  hypo¬ 


thesis  both  '  x^_^)/fx^)  <  0  and  /(x^J /ffx^^)  <  0;  and  because  xn+^ 

lies  between  x  ,  and  and  f  Is  stronglv  aonotonic,  f(x  ,)!f(x  ,)  >  1. 

n-i  n-1  n+1 

Therefore 


Proposition  6:  If  the  Secant  Iterates  x  are  ultimately  (l.e.  for  all 

PI 

sufficiently  large  n)  all  Permanences,  or  ultimately  all  Variances, 
they  converge  to  £. 


P^oof:  If  ultimately  all  x^  are  Permanences  they  form  a  sequence 
which  is  ultimately  aonotonic  and  bounded  (by  C)  in  I;  therefore  the 
sequence  converges  and.  by  proposition  3,  converges  to  C.  If  ultimately 


62 


all  x  are  Variances  then  Che  subsequences  {x.  }  ar.d 

n  2n 

matelv  monotonic  and  convergent;  moreover  |x„  ,  -  x_  ! 

2n+l  2n‘ 

fast  as  some  multiple  of  4  n  because  of  proposition  5, 
claimed . 


{x,  }  are  ultl- 

2n+l 

0  at  least  as 
so  x  •»  £  ®s 


Permanences  are  best  thought  of  as  punctuation  marks  separating  strings 
of  consecutive  Variances,  and  the  only  significant  property  of  each  string 
Is  whether  Its  length  Is  even  or  odd.  The  significance  of  even  length  Is 
suggested  by  the  next  proposition,  whose  straightforward  proof  is  omitted. 


Proposition  7;  If  a  Permanence  x  Is  followed  bv  an  even  number  2k  >  0 

of  consecutive  Variances  x  , ,x  ,,..., x  before  the  next  Permanence 

h+1  n+2  n+2Jc 

*  ^->1.^1  •  then  the  numbers  x  ,  ,x  ,x  . ,  ,x  .  , . . .  ,x  ,x 
n+2*+l  n-1  n  n+2  n+4  n+2k  n+2k+l 

x  . . x  ,,x  .  are  exhibited  here  In  strictlv  monotonic  order. 

n+2rt-l  n+3  ns  l 

(If  2*c  •  0  or  2  delete  the  appropriate  right -hand-most  x's.) 

See  Fig.  12. 

If  at  moat  finitely  many  strings  of  Variances  have  odd  length  the 


convergence  properties  of  the  sequence  {x  }  are  relatlvelv  transparent, 
as  the  next  proposition  shows. 


Proposition  8:  If  ultimately  no  two  Permanences  are  separated  by  an  odd 
number  of  Variances  then  the  Permanences  converge  to  £;  If  also  .V 
Is  continuous  at  £  the  Variances  converge  to  £  too. 

Proof:  By  re-numberlng  the  Iterates  to  discard  some  early  ones  If 

necessary,  we  may  assume  no  two  Permanences  are  separated  by  an  odd  number 

of  Variances,  In  which  case  the  previous  proposition  Implies  that  the 

Permanences  and  their  antecedent  Iterates  constitute  a  monotonlc  bounded 

(by  £)  subsequence  of  the  Iterates.  In  other  words,  if  the  successive 

Permanences  are  x  ,x  ,x  ....  Chen  Che  numbers 
"l  n2  ”3 
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1»x  * <■  _  ,  >X  «X  .  |J 

\  V1  n2  V1 


are  displayed  here  in  strictly  monotonlc  order.  Obviously  this  subsequence 

of  x's  must  converge  and,  by  proposition  3,  it  must  converge  to  C.  Before 

we  find  out  what  happens  to  the  rest  of  the  iterates  x^  let  us  invoke  §6's 

Mean  Value  lemma  to  define  y  as  a  solution  of  x  . ,  •  N(y  )  between 

n  .  i  .+1  ~  n  . 

J  J  J 

x  ,  and  the  Permanence  x  for  J  •  1,2,3 .  Evidently  y  —  £  too. 

n.-l  n.  *  •  *  Jn . 

J  V  |/ 

If  .Y  is  continuous  at  £  we  mav  infer  first  that  x  ,  ■  Yfv  7  ♦  Y(£7  ■  £ 

n  .+1  “ n  . 

V  J 

and  then,  from  proposition  7,  that  all  x  -*  £  as  n  *  <®.  If  .V  is  discon- 

tinuous  at  £  there  is  some  risk  that  x  .  -  S(y  )  4*  .Vf£j;  this  sltua- 

n„.+l  "n .  ' 

V 

tlon  arises  with  examples  f  one  of  which  is  exhibited  at  the  proof's  end. 

To  complete  the  theorem's  proof  we  need  only  deal  with  the  possibility 

that  infinitely  many  pairs  of  consecutive  Permanences  are  separated  by  odd 

numbers  of  Variances.  This  possibility  is  awkward  only  because  the  notation 

required  to  deal  with  it  is  complicated. 

Let  us  Invoke  §6's  Mean  Value  lemma  again  to  define  for  every  Permanence 

x  the  solution  y  of  Yf'v  )  -  x  ,  between  x  and  x  ,  and  closest 
k  "n  n  n+l  n  n-1 

to  x^.  These  solutions  y^  were  useful  already  during  the  previous  propo¬ 
sition's  proof,  but  thev  are  crucial  below  because  thev  provide  the  sole 

entree  for  V's  No-Swap  condition.  Note  that  w  is  so  far  defined  not  at 

~  n 

everv  n  but  onlv  at  those  k  for  which  x  is  a  Permanence. 

Next  define  a  Scout  to  be  a  Permanence  x  followed  bv  an  odd  number 

n 

of  Variances,  say  xn4>1  •xn+2,in+3 . Xn*2k  +1  •  and  then  *nother  P*rmanence 

n 

xr+’’k  *2'  T*lat  ^asC  Per!Mnen<:e  »ight  a  Scout  too  or  it  might  not.  Also 
n 

define  x  to  be  a  Guard  whenever  x  Is  a  Scout.  We  shall  think  of 
n+1  n 

each  as  a  part  of  a  convoy  with  a  Scout  ranging  ahead  of  it  and  a  Guard 

bringing  up  the  rear,  and  our  last  task  will  be  to  show  that  alternate  convoys 


converge  aonotonically  to  £  from  opposite  sides.  See  Fig.  13. 


<Sr  •/*'+**'%  &jiM>rJi  i 


n«s2k»2~ 
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Proposition  9:  Suppose  x  and  x  are  consecutive  Scouts  with  m  >  n; 

y\  fft 


then  m  >  n  + 2  and  x  , .  U  ,  X  , , .  C ,  U  ,  X  , ,  are  displayed  here 

—  Pi— 1  PI 

in  strictly  monotonic  order. 

See  Fig.  14. 

Proof:  For  definiteness  assume  x  <  x  ,  whereupon  it  follows  that 

n— 1  n 


x  ,  <  y  <  x  < 

n-1  “  n  —  n 


<  x  , .  Moreover  we  know  that  the  Scout  x  is  followed 
n+1  n 


bv  an  odd  number  of  Variances  x  . , ,x  x  and  then  the  next 

n+1  n+2  n+2*:+l 

y  <  x  <  a ...  „  < 

—  n  n+2 

We  also 


Permanence  x  so  "?  >  n+21c+2  >  n+2  and  x 

n+2<+2  -  - 


rn+4 


<  Xn+2Jc  < 


“n+2l:+2  <  “'n+2*c+l 


n-1 
<  x 


'n  —  n 
<  x 


n+3  ~n+l  ’ 

know  that  an  even  number  of  Variances  separates  every  two  consecutive 

Permanences  between  x  ,  and  x  ,  so  proposition  7  implies 

n+2<+2  n 


*  <  r  <  x  <  x  <x  : 

■n  in-1  —  n+2fc+l  —  n+1 

and  then  x  <  Z  <  x  <  v  <x  .<x  ,  .  The  onlv  question  left  is  whether 

”+■1  t  —  J>n  m-1  —  n+1 

or  not  <  x^.  If  not.  if  instead  x^  -  .V^J  <  <  C  <  <  *n+1 

-  .Vfy  7,  then  the  Mo  Crossover  condition  satisfied  bv  .V  would  be  violated 
n 

contrary  to  the  theorem’s  hypotheses.  Therefore  proposition  9  is  proved 
and  more; 


“n-1 


y  K  2 

“  n  -  n 


and  y 


<  X-n+l 


<  x  <  y  <  x 


-1  -  Xn+1 


Proposition  10:  If  the  sequence  of  Secant  Iterates  x^  contains  infinitely 

many  Scouts  then  a  subsequence  of  Cuards  converges  to  I,  and  all 

x  -*  Z  if  V  is  continuous, 
n 

Proof:  Let  the  Integer  sequence  m(l)  <  m(2)  <  nfli  <  •••  characterize 

consecutive  Scouts  x  ...  Cuards  x  ,  and  convov's  contents  w_,. 

Assuming  for  definiteness  that  x_  ^  <  C.  we  infer  from  proposition  9 

et  seq.  that  for  j  ■  1,2,3,... 
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Xm(2j)+l  -  xn(2j+l)-l  <  yn(2J+ lj 


-  *m(2j)  K 
<  x  ,  • 


Xm(2j)-1  -  *m(2j-l)+].  and 


3y  tr»ductlon  it  follows  that  y  ,  Increases  to  a  limit  u  and  u  ,,  . , 

wi  A7+1  >  2j  J 

decreases  to  a  limit  j  >  C  >  },  and  the  same  is  true  respectively  for  the 
Guards  ^yj+1  *  P  and  -  *.  as  J  (The  Scouts 

and  x  need  not  form  monotonic  subsequences.) 

"I!  •^,’+1 

The  possibility  that  y  <  Z  <  y  can  be  ruled  out  because 
y  “  *  C  would  imply  y  -  llmx^  ,.,+1  -  1  im  *  u  fi(y) 

and  similarly  y  ■  S(y)  in  violation  of  the  No  Swap  condition  satisfied 
by  .V.  And  if  ,V  is  continuous  at  C  a  similar  argument  shows  y  ■  £  ■ 
but  in  this  case  we  soon  infer  that  all  iterates.  Scouts  included,  are 
squeezed  towards  Z  by  the  Guards  and  hence  converge  to  Z • 

Propositions  6,  8  and  10  exhaust  all  possible  ways  for  the  sequence 
of  Secant  iterates  to  behave,  and  hence  prove  the  theorem.  □ 


Example:  In  this  example  f(x)  is  infinitely  differentiable  throughout 

I  =  ‘"I  I-i'l/d  “  1/1 n  2)  ■2.25889...},  vanishes  only  at  x  ■  Z  =  0 , 
and  is  elsewhere  in  I  strongly  monotonic.  Newton's  iteration  converges 
from  every  x^  in  I,  but  the  Secant  iteration  suffers  from  an  oscillation 
in  which  every  third  iterate  x  “  "1  though  the  remaining  iterates 

'.X.j^  and  x^_^}  converge  to  C.  The  construction  of  f  is  complicated 
enough  that  only  an  outline  can  be  presented  here. 

Start  by  defining  for  n  ■  0,1,2,...  the  descending  sequences 

5  1/1n<'~2>  *"'i  C3~L  =  W/2'  “U'  5  -  *U 

n  >  0.  Next  define 


■ 


> 


*(Z)  =  z  expa/cj  for 

=0  at  z  •  o  , 


<  Z  <  0  . 
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-F  )/2 
In  ^3n+3;/ 

♦  S  o([2Z-Z, 


"3n+l  -  ^  -  ;3r  ' 
'3rj  '  ^3n+l'^3n+3^ 


r  <  r  <  r 

^3n+3  -  *•  -  t;3r:+l 


where  ^  =  {*<1^)  +  /*  . 

3n  5  (*rCJn+lJ  ~*(Z3n+3J)/2  •  and 
af07  =  tanhftan('^r0/277  if  -1  <  9  <  1  , 


i  signi 67  otherwise  . 


Finally  f'x)  =  ( l-tsr)0fx/ ( 1+x) )  and  x  =  Z  /(l-£  )•  The  tedious  verifi- 

n  n  n 

cation  of  the  claims  made  above  for  f  are  left  to  the  reader. 


The  theorem  that  has  Just  been  proved  does  not  yet  render  Newton’s 
Iteration  obsolete.  Rather  It  supplies  a  powerful  Incentive  for  replacing 
Newton's  by  the  Secant  Iteration  In  those  cases  where,  as  In  §8's  Application  1, 
the  calculation  of  a  derivative  appears  to  confer  no  advantage.  Many  cases 
remain  to  be  analyzed;  for  Instance,  we  cannot  yet  say  whether  the  Secant 
Iteration  works  acceptably  well  on  that  rational  function  In  §4  with  inter¬ 
lacing  poles  and  zeros  on  which  Newton's  Iteration  works  so  well. 

A  final  warning;  computer  programs  based  upon  Newton's  or  the  Secant 
Iteration  rarely  use  an  Iteration  in  its  pristine  fora.  Programs  usually 
Incorporate  extra  "features"  which  modify  the  Iterations  in  wavs  that  their 
designers  hope  will  effect  some  improvements.  Sometimes  those  features  do 
yield  an  improvement,  sometimes  not;  they  almost  always  undermine  the  fore¬ 
going  analysis. 
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